Web Crawler Used in Search Engine

Abstract

The World Wide Web (WWW) is a collection of billions of documents formatted using HTML. Web Search engines are used to find the desired information on the World Wide Web. Whenever a user query is inputted, searching is performed through that database. The size of repository of search engine is not enough to accommodate every page available on the web. So it is desired that only the most relevant pages must be stored in the database. So, to store those most relevant pages from the World Wide Web, a better approach has to be followed. The software that traverses web for getting the relevant pages is called “Crawlers” or “Spiders”. A specialized crawler called focussed crawler traverses the web and selects the relevant pages to a defined topic rather than to explore all the regions of the web page. The crawler does not collect all the web pages, but retrieves only the relevant pages out of all. So the major problem is how to retrieve the relevant and quality web pages.

Authors and Affiliations

Harshali Kshirsagar, Pratibha Rewaskar, Komal Ramteke

Keywords

Related Articles

Design and Implementation of Reliable Solar Tree

Flat or roof top mountings of Photovoltaic (PV) structures require large location or land. Scarcity of land is greatest problem in towns or even in villages in India. Sun strength Tree presents higher opportunity to fla...

Image Quality Assessment for Multi Exposure Fused Images

Multi-exposure picture combination (MEF) is viewed as a successful quality improvement procedure generally received in buyer gadgets. In this paper, we do a subjective client study to assess the nature of pictures creat...

Privacy Preserving in Data Mining

Data mining is an increasingly important technology for extracting useful knowledge hidden in large collection of data. It is today well observe that database represent important role in many application and for this re...

Non Traditional Optimization Techniques For Cutting Force Optimization In Milling Process Based On Machining Parameters

Minimum cutting forces are always gives the better results on response parameters. In this paper describes the nontraditional optimization methods to get the optimum cutting force in milling process. The objective funct...

Cloud Computing For Rural India

Majority of Indian population lives in the villages and hence the future of India lies in the development of rural India. Cloud Computing is the revolution in computing domain where the cloud resources are made availabl...

Download PDF file
  • EP ID EP20109
  • DOI -
  • Views 220
  • Downloads 4

How To Cite

Harshali Kshirsagar, Pratibha Rewaskar, Komal Ramteke (2015). Web Crawler Used in Search Engine. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 3(4), -. https://europub.co.uk/articles/-A-20109