Deep Web Crawler: A Review

Abstract

In today’s scenario, there is an ample amount of data on the internet that can be accessed by everyone. This is the data that can be indexed by search engines. There are softwares named Web Crawlers that explore the WWW in an efficient manner. But there is also a large amount of data that is still out of reach from the access of the conventional search engines. This is known as Deep Web or Invisible Web. Web pages that are hidden created dynamically as a result of queries send to particular web databases. For traditional web crawlers, it is almost impossible to access the content of deep web due to its structure. To retrieve the contents of deep web is a challenge in itself. This paper discusses the methods and tools of crawling the web that is hidden beneath the surface.

Authors and Affiliations

Smita Agrawal, Kriti Agrawal

Keywords

Related Articles

In-Depth Energy Analysis and Consumption Prediction of India

Today's world requires extremely efficient energy consumption. Demand is rising as a result of the industrial sector's quick advancements, making energy efficiency initiatives essential to reducing energy waste and satis...

Effective Pattern Discovery for Text Mining Using Pattern Taxonomy Model

We describe an effective and innovative pattern discovery technique. In order to overcome the problem of misinterpretation and low frequency pattern taxonomy model is used. It makes use of closed sequential patterns and...

A Systematic Review of Challenges in Fog Computing

The number of Internet of Things (IoT) applications is rapidly increasing. Current cloud-centric IoT designs, on the other hand, are unable to meet the mobility and dormancy necessities of duration precarious IoT practic...

Smart Agriculture Using IoT

The Internet of Things (IOT) is transforming agricultural by incorporating farmers in a variety of approaches to tackle challenges in the field, like as precise and conservatism farming. Harvest web surveillance includes...

Use of Smart Intrusion Detection System for Enhancing the Security in Hierarchical Wireless Sensor Network

Trusted environment provides safety measures for the sensor network. There are many problems that occur during the management of resources. Memory management and computation overhead or CPU usage are the major issues. Se...

Download PDF file
  • EP ID EP744570
  • DOI -
  • Views 27
  • Downloads 0

How To Cite

Smita Agrawal, Kriti Agrawal (2013). Deep Web Crawler: A Review. International Journal of Innovative Research in Computer Science and Technology, 1(1), -. https://europub.co.uk/articles/-A-744570