A Novel Architecture of Agent based Crawling for OAI Resources

Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 4

Abstract

Nowadays, most of the search engines are competing to index as much of the Surface Web as possible with leaving a lurch at the OAI content (pdf documents), which holds a huge amount of information than surface web. In this paper, a novel framework for OAI-PMH based Crawler is being proposed that uses agents to extract the metadata about the OAI resources nd store them in a repository which is later on queried hrough he OAI-PMH layer to generate the XML pages ontaining the metadata. These pages are further added to the search gines repository for indexing that makes in turn increases the relevancy of Search Engine. Agents are being used to rallelize the whole process so that metadata extraction from multiple resources can be carried out simultaneously.

Authors and Affiliations

Shruti Sharma , J. P. Gupta , A. K. Sharma

Keywords

Related Articles

CLOUD COMPUTING AND ITS PRICING SCHEMES

Cloud computing is a rapidly emerging technology which involves deployment of various services like software, web services and virtualized infrastructure, as a product on public, private or hybrid clouds on lease basis....

A PATTERN RECOGNITION LEXI SEARCH APPROACH TO TRAVELLING SALESMAN PROBLEM WITH ADDITIONAL CONSTRAINTS

There are n cities and N = {1, 2,… n}. Let {1} be the headquarter city and the subheadquarter cities i.e.,H = {a1, a2… ah} be the subset of N. The cost array C (i, j) indicates the cost of the travelling salesman by visi...

A STUDY OF CLONE DETECTING TECHNIQUES IN STATIONARY AND MOBILE WIRELESS SENSOR NETWORK

Mobile Wireless sensor network (MWSN) is one of the recently emerging areas in which mobility of sensor nodes play a major role. Sensor nodes are allowed to move freely and are allowed to communicate with each other with...

Web Browser Personalisation Design of a Client Side Web-page Access Prediction Mechanism

Web usage prediction has become a widely addressed topic with the huge proliferation of World Wide Web and computers. Most of the work done in this area of research is centered around prediction of what links the user is...

Modern Trends Used In Operating Systems For High Speed Computing Applications

Operating system researches traditionally consist of adding new functions to the operating system in other words inventing and evaluating new methods for performing functions. Operating systems are the single most comple...

Download PDF file
  • EP ID EP91883
  • DOI -
  • Views 113
  • Downloads 0

How To Cite

Shruti Sharma, J. P. Gupta, A. K. Sharma (2010). A Novel Architecture of Agent based Crawling for OAI Resources. International Journal on Computer Science and Engineering, 2(4), 1190-1195. https://europub.co.uk/articles/-A-91883