A New Hidden Web Crawling Approach

Abstract

Traditional search engines deal with the Surface Web which is a set of Web pages directly accessible through hyperlinks and ignores a large part of the Web called hidden Web which is a great amount of valuable information of online database which is “hidden” behind the query forms. To access to those information the crawler have to fill the forms with a valid data, for this reason we propose a new approach which use SQLI technique in order to find the most promising keywords of a specific domain for automatic form submission. The effectiveness of proposed framework has been evaluated through experiments using real web sites and encouraging preliminary results were obtained

Authors and Affiliations

L. Saoudi , A. Boukerram , S. Mhamedi

Keywords

Related Articles

A Fuzzy based Model for Effort Estimation in Scrum Projects

This paper aims to utilize the fuzzy logic concepts to improve the effort estimation in Scrum framework and in turn add a significant enhancement to Scrum. Scrum framework is one of the most popular agile methods in whic...

Differential Evolution based SHEPWM for Seven-Level Inverter with Non-Equal DC Sources

This paper presents the application of differential evolution algorithm to obtain optimal switching angles for a single-phase seven-level to improve AC voltage quality. The proposed inverter in this article is composed o...

Design of A high performance low-power consumption discrete time Second order Sigma-Delta modulator used for Analog to Digital Converter

This paper presents the design and simulations results of a switched-capacitor discrete time Second order Sigma-Delta modulator used for a resolution of 14 bits Sigma-Delta analog to digital converter. The use of operati...

Design of Linear Phase High Pass FIR Filter using Weight Improved Particle Swarm Optimization

The design of Finite Impulse Response (FIR) digital filter involves multi-parameter optimization, while the traditional gradient-based methods are not effective enough for precise design. The aim of this paper is to pres...

A Leveled Dag Critical Task Firstschedule Algorithm in Distributed Computing Systems

In distributed computing environment, efficient task scheduling is essential to obtain high performance. A vital role of designing and development of task scheduling algorithms is to achieve better makes pan. Several tas...

Download PDF file
  • EP ID EP106530
  • DOI 10.14569/IJACSA.2015.061039
  • Views 98
  • Downloads 0

How To Cite

L. Saoudi, A. Boukerram, S. Mhamedi (2015). A New Hidden Web Crawling Approach. International Journal of Advanced Computer Science & Applications, 6(10), 293-297. https://europub.co.uk/articles/-A-106530