A Novel Architecture of Agent based Crawling for OAI Resources

Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 4

Abstract

Nowadays, most of the search engines are competing to index as much of the Surface Web as possible with leaving a lurch at the OAI content (pdf documents), which holds a huge amount of information than surface web. In this paper, a novel framework for OAI-PMH based Crawler is being proposed that uses agents to extract the metadata about the OAI resources nd store them in a repository which is later on queried hrough he OAI-PMH layer to generate the XML pages ontaining the metadata. These pages are further added to the search gines repository for indexing that makes in turn increases the relevancy of Search Engine. Agents are being used to rallelize the whole process so that metadata extraction from multiple resources can be carried out simultaneously.

Authors and Affiliations

Shruti Sharma , J. P. Gupta , A. K. Sharma

Keywords

Related Articles

Iris recognition based on subspace analysis

Biometrics deals with the uniqueness of an individual arising from their physiological or behavioral characteristics for the purpose of personal identification. Among many biometrics techniques, iris recognition is one o...

Metric for Early Measurement of Software Complexity

Software quality depends on several factors such as on time delivery; within budget and fulfilling user's needs. Complexity is one of the most important factors that may affect the quality. Therefore, measuring and contr...

An Analysis on Preservation of Privacy in Data Mining

Privacy has become a key issue for progress in data mining. Maintaining the privacy of data mining has become ncreasingly popular because it allows sharing of privacy-sensitive data for analysis. So people are still rel...

Image Mining using Content Based Image Retrieval System

The image depends on the Human perception and is also based on the Machine Vision System. The Image Retrieval is based on the color Histogram, texture. The perception of the Human System of Image is based on the Human Ne...

Identity Recognizing using Iris Scan with multiple frames of Video

The field of iris biometrics has been significant research over the last decade. At this point, iris capture has become a main stream technology with wide acceptance. A general iris recognition system works with four dif...

Download PDF file
  • EP ID EP91883
  • DOI -
  • Views 97
  • Downloads 0

How To Cite

Shruti Sharma, J. P. Gupta, A. K. Sharma (2010). A Novel Architecture of Agent based Crawling for OAI Resources. International Journal on Computer Science and Engineering, 2(4), 1190-1195. https://europub.co.uk/articles/-A-91883