A Novel Architecture of Agent based Crawling for OAI Resources - Europub

Search

Apply

A Novel Architecture of Agent based Crawling for OAI Resources

Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 4

Abstract

Nowadays, most of the search engines are competing to index as much of the Surface Web as possible with leaving a lurch at the OAI content (pdf documents), which holds a huge amount of information than surface web. In this paper, a novel framework for OAI-PMH based Crawler is being proposed that uses agents to extract the metadata about the OAI resources nd store them in a repository which is later on queried hrough he OAI-PMH layer to generate the XML pages ontaining the metadata. These pages are further added to the search gines repository for indexing that makes in turn increases the relevancy of Search Engine. Agents are being used to rallelize the whole process so that metadata extraction from multiple resources can be carried out simultaneously.

Authors and Affiliations

Shruti Sharma , J. P. Gupta , A. K. Sharma

Keywords

OAI-PMH; Agents; Surface web; Hidden Web

Related Articles

GRID SCHEDULING USING ENHANCED PSO ALGORITHM

Grid computing is a high performance computing environment to solve larger scale computational demands. Grid computing contains resource management, task scheduling, security problems, information management and so on. T...

Automatic Clustering Approaches Based On Initial Seed Points

Since clustering is applied in many fields, a number of clustering techniques and algorithms have been proposed and are available in the literature. This paper proposes a novel approach to address the major problems in a...

Automated Load Shedding Period Control System (An effective way to reduce human effort)

Energy is the basic necessity for the economic development of a country. Many functions necessary to present-day living grind to halt when the supply of energy stops. It is practically impossible to estimate the actual m...

A Heuristic Approach to the Disease Diagnose System Using Machine Learning Algorithms

Abstract--The paper deals with the concepts of expert system and data mining belongs to the Artificial Intelligence fields. The main task of expert system is to ratiocination, while the machine learning algorithm is to f...

Content Aware Media Retargeting for still images using Seam Carving

When changing height and width of image traditional techniques for image resizing are oblivious to the content of image. A simple operator seam carving is used for image and video retargeting. This seam carving operator...

Download PDF file

EP ID EP91883
DOI -
Views 123
Downloads 0