I-ViDE: An Improved Vision-Based Approach for Deep Web Data Extraction

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 4

Abstract

 Abstract: Deep Web contents are accessed by queries submitted to Web databases and the returned data records are enwrapped in dynamically generated Web pages (they will be called deep Web pages in this paper). Extracting structured data from deep Web pages is a challenging problem due to the underlying intricate structures of such pages. Until now, a large number of techniques have been proposed to address this problem, but all of them have inherent limitations because they are HTML language dependent .Visual features are not taken into consideration. All previous methods are mostly dependent on table tags. A Vision based approach for web data extraction has overcome the limitations of previous work by utilizing some interesting common visual features on the web page. But still this approach has one drawback that it can process web page containing only one data region. Due to processing of one data region it reduces the precision and recall rate. As precision give us the rate that how many correct data records are extracted from relevant data records and recall give us the rate that how many relevant data records are extracted from overall data records. The proposed Improved-ViDE approach handles multi data-region in deep web pages which can improve the precision rate and recall rate.

Authors and Affiliations

Mrudula Varade , Vimla Jethani

Keywords

Related Articles

New Technique For Preventing SQL Injection Attack Based On Normal Use Model

Online applications have turned into an essential piece of our day by day lives, yet in the meantime, security of digital data put away in the web databases has been a developing concern. SQL injection attacks have been...

Analysis of Development Factors for Asian Countries using DWM on Big Data

In today’s world, most of the developing countries are rising to become a developed country. There have been analysis of countries that experienced banking crisis in the past. However, the analysis included only data pre...

 A classification of methods for frequent pattern mining

 Abstract: Data mining refers to extracting knowledge from large amounts of data. Frequent pattern mining is aheavily researched area in the field of data mining with wide range of applications. Frequent itemsets is...

 Organizational Strategies and Social Interaction Influence in Software Development Effort Estimation

 Abstract: In software development cost estimation, effort allocation is an important and usually challenging task for project management. This paper observes the use of concepts in software effort estimation by ana...

 An Automated Approach for Job Scheduling and Work Flow Mining

 Abstract: Now a day’s work allotment in a software firm become more important and cumbersome. The main objective of this concept is to reduce the work of the software developers in a software company and work alloc...

Download PDF file
  • EP ID EP94389
  • DOI 10.9790/0661-16440922
  • Views 121
  • Downloads 0

How To Cite

Mrudula Varade, Vimla Jethani (2014).  I-ViDE: An Improved Vision-Based Approach for Deep Web Data Extraction. IOSR Journals (IOSR Journal of Computer Engineering), 16(4), 9-22. https://europub.co.uk/articles/-A-94389