A Novel Data Extraction and Alignment Method for Web Databases

Journal Title: International Journal of Modern Engineering Research (IJMER) - Year 2013, Vol 3, Issue 4

Abstract

 Online databases, also called web databases, comprise the deep web tag and value. Compared with WebPages in the surface web, which can be accessed by using a unique URL, pages in the deep web are dynamically generated in response to a user query submitted through the query interface of a web database. Upon receiving a user’s query, a web database returns the relevant data, either structured or semi structured, encoded in the HTML pages. Many web applications, such as data integration, met querying and comparison shopping, need the data from multiple web databases. For these applications to further utilize the data embedded in HTML pages, the automatic data extraction is necessary. Only when the data are extracted and organized in a structured manner, such as tables, can they be aggregated and compared. Hence, accurate data extraction is vital for these applications to perform process correctly. This paper focuses on the problem of automatically extracting data records that are encoded in the query result pages generated from web databases.

Authors and Affiliations

Sravan Kumar Teegala

Keywords

Related Articles

High Performance MAC Unit for FFT Implementation

In this paper we have proposed an efficient way of implementing a Fast Fourier Transform (FFT) processor using high performance pipelined Multiply and Accumulate (MAC) unit. The multiplication unit is implemented us...

Study of Flexible Lift Mechanism for Press Shop

The industrial sector is one of the important sectors of the Indian economy. The Small Scale Industries (SSI) sector is one of the most vital sectors of the Indian Economy in terms of employment generation, the strong en...

 An optimised multi value logic cell design with new architecture of many value logic gates

 Propose thesis work is a design of a Multi Logic Memory cell of four logic levels which can hold Logic 0, Logic 1, Logic 2 & Logic 3 and also propose an Interface module design between multi logic system with b...

Application of CNTFET as Logic Gates and its implementation using HSPICE

The steady reduction in the dimension of transistors, according to Moore's law has been the main force behind the regular leaps in the level of performance of the silicon ICs. Due to the effects like the short channel ef...

 Token Based Packet Loss Control Mechanism for Networks

 Modern IP network services provide for the simultaneous digital transmission of data, voice, and video. These services require congestion control algorithms and protocols which can solve the packet loss parameter...

Download PDF file
  • EP ID EP130910
  • DOI -
  • Views 103
  • Downloads 0

How To Cite

Sravan Kumar Teegala (2013).  A Novel Data Extraction and Alignment Method for Web Databases. International Journal of Modern Engineering Research (IJMER), 3(4), 2344-2346. https://europub.co.uk/articles/-A-130910