Web Crawler Used in Search Engine

Abstract

The World Wide Web (WWW) is a collection of billions of documents formatted using HTML. Web Search engines are used to find the desired information on the World Wide Web. Whenever a user query is inputted, searching is performed through that database. The size of repository of search engine is not enough to accommodate every page available on the web. So it is desired that only the most relevant pages must be stored in the database. So, to store those most relevant pages from the World Wide Web, a better approach has to be followed. The software that traverses web for getting the relevant pages is called “Crawlers” or “Spiders”. A specialized crawler called focussed crawler traverses the web and selects the relevant pages to a defined topic rather than to explore all the regions of the web page. The crawler does not collect all the web pages, but retrieves only the relevant pages out of all. So the major problem is how to retrieve the relevant and quality web pages.

Authors and Affiliations

Harshali Kshirsagar, Pratibha Rewaskar, Komal Ramteke

Keywords

Related Articles

A Rate Allocation Algorithm in Uplink SC-FDMA Systems for Enhancing Network Throughput

For a cellular system where mobile terminals transmit in the uplink to base stations (BSs) using single carrier- frequency division multiple access (SC-FDMA), we consider mul- ticell processing among BSs. Received signa...

Automated Radionics Device for Health-Care Solution in Global Area Networks.

This paper shows a archetype of Automatic Machine for perfect health aided solution that adjoins mobile and Internet Protocol Version 6 and approaches in a Radionics sensor network to analyze the health condition of pat...

A Matlab Based Fault Analysis of TL

Nowadays the demand of electricity or power are increases day by day this result to transmit more power by growing the transmission line capacity from one place to the other place. But during the transmission some fault...

Design of a Parallel Multi-Threaded Programming Model for Multicore Architecture with Resource Sharing

Multi-core architectures have become main stream, and multi-core processors are found in products ranging from small portable cell phones to large computer servers. In parallel, research on real-time systems has mainly...

Review on Different PAN Sharpening Methods

In current technological development in electronics era have promoted real time remote sensing system to collect fine image resolution over the ground. The earth observation satellite such as Quick-bird, IKONOS, landsat...

Download PDF file
  • EP ID EP20109
  • DOI -
  • Views 222
  • Downloads 4

How To Cite

Harshali Kshirsagar, Pratibha Rewaskar, Komal Ramteke (2015). Web Crawler Used in Search Engine. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 3(4), -. https://europub.co.uk/articles/-A-20109