A Novel Architecture of Agent based Crawling for OAI Resources

Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 4

Abstract

Nowadays, most of the search engines are competing to index as much of the Surface Web as possible with leaving a lurch at the OAI content (pdf documents), which holds a huge amount of information than surface web. In this paper, a novel framework for OAI-PMH based Crawler is being proposed that uses agents to extract the metadata about the OAI resources nd store them in a repository which is later on queried hrough he OAI-PMH layer to generate the XML pages ontaining the metadata. These pages are further added to the search gines repository for indexing that makes in turn increases the relevancy of Search Engine. Agents are being used to rallelize the whole process so that metadata extraction from multiple resources can be carried out simultaneously.

Authors and Affiliations

Shruti Sharma , J. P. Gupta , A. K. Sharma

Keywords

Related Articles

Overload Identification for Multiprocessor in Real Time System

In spite of many real time scheduling algorithms available it is not clear that these scheduling algorithms support fully the problems in the real time system in a local area network. There are certain “open loop” algori...

MAX-MIN ANT OPTIMIZER FOR PROBLEM OF UNCERTAINITY

The real life problems deal with imperfectly specified nowledge and some degree of imprecision, uncertainty or nconsistency is embedded in the problem specification. The well-founded theory of fuzzy sets is a special w...

Advanced Low Energy Adaptive Clustering Hierarchy

The use of Wireless Sensor Networks (WSNs) is anticipated to bring enormous changes in data gathering, processing and dissemination for different environments and applications. However, a WSN is a power constrained syste...

Enhancing Security Of Agent-Oriented Techniques Programs Code Using Jar Files

Agent-oriented techniques characterize an exciting new way of analyzing, designing and building complex software systems in real time world. These techniques have the prospective to significantly improve current practice...

Classification of Indian Stock Market Data Using Machine Learning Algorithms

Classification of Indian stock market data has always been a ertain appeal for researchers. In this paper, first time ombination of three supervised machine learning algorithms, lassification and regression tree (CART...

Download PDF file
  • EP ID EP91883
  • DOI -
  • Views 116
  • Downloads 0

How To Cite

Shruti Sharma, J. P. Gupta, A. K. Sharma (2010). A Novel Architecture of Agent based Crawling for OAI Resources. International Journal on Computer Science and Engineering, 2(4), 1190-1195. https://europub.co.uk/articles/-A-91883