Improvement of performance of web crawlers for efficient web searching and crawling

Abstract

The effectiveness of a crawler directly affects the efficiency of the searching quality of the web search engines. As the crawler interacts with billions of hosts or servers over a period of weeks or months, the issues of validity, flexibility and manageability are of major importance. Also crawler could retrieve some other information, which may be of unimportant to the search from the HTML files as it is parsing them to get the new URLs. In this paper, an attempt has been made to improve the performance of the web crawler by analyzing certain features of several algorithms such as best-first, breadth-first, pagerank, shark search and HITS. For this, various performance parameters such as precision, recall, accuracy and F-Score are taken into consideration. Based on the output parameters, an analysis is made for the improvement of web crawler towards web searching.

Authors and Affiliations

Dr. P. Jaganathan , S. Jaiganesh , P. Babu

Keywords

Related Articles

An Improved Montgomery’s Method Over Public-Key Cryptosystem

This paper deals with improving Montgomery’s algorithm. We improve mongomery’s algorithm such that modular multiplications can be executed two times faster. Each iteration in our algorithm requires only one addition, whi...

Performance Evolution and Modeling of Vapor Absorption System Using Flat Plate Collector

This paper presents to evaluate the characteristics and performance of vapour absorption refrigeration system using single stage lithium bromide – water (LiBr – H2O) as absorbent and refrigerant. The all parameters of re...

Performance analysis of AODV &GPSR routing protocol in VANET

VANET (Vehicular Ad Hoc Network) is an emerging technology to achieve intelligent inter vehicle communications, it is the specialized derivation of pure multi hop ad hoc networking and are already going through industria...

PEER TO PEER ASSOCIATION IN CONTENT DISTRIBUTION NETWORK

The troublesome issue of method associated implementing an economical law for load reconciliation in Content Delivery Networks (CDNs). We have a tendency to tend to base our proposal on a correct study of a CDN system, d...

Smart Green House Automation

Smart Green House Automation is a complete system to monitor and control the environment parameters inside a green house .It is necessary to design a control system to monitor various parameters like Temperature, Humidit...

Download PDF file
  • EP ID EP109334
  • DOI -
  • Views 116
  • Downloads 0

How To Cite

Dr. P. Jaganathan, S. Jaiganesh, P. Babu (2013). Improvement of performance of web crawlers for efficient web searching and crawling. International Journal of Computer Science & Engineering Technology, 4(4), 311-318. https://europub.co.uk/articles/-A-109334