A SURVEY OF TOOLS FOR EXTRACTING AND ALIGNING THE DATA IN WEB

Abstract

The world-wide web is rapidly growing day by day in all fields, mining the data from multiple websites is necessary to filter the relevant contents. Although many approaches developed for extracting the data, there were some difficulties found when using such tools. In this paper, we survey web data extraction and alignment process in two dimensions: record extraction and alignment. The first dimension explains the extracting data records from multiple query result pages automatically. The second one measures similarity between the data records for aligning the records by pairwise and holistically and then nested structure processing. We believe these criteria enhance the performance measures to check existing data extraction methods.

Authors and Affiliations

SureshKumar. T , Sivaranjani. S , Dr. Shanthi. N

Keywords

Related Articles

Quantum computation and Schizophrenia

Microtubules in the brain have been associated with quantum computation and consciousness. The microtubules have been reorted to be affected in schizophrenia. Schizophrenia is also associated with hallucinations (auditor...

EFS: Enhanced FACES Protocol for Secure Routing In MANET

Mobile Ad-hoc Network (MANET) is an autonomous system of mobile hosts equipped with wireless communication devices. These mobile nodes can form a network anywhere and at anytime. But the topology of the network thereby f...

A Literature Review: Cryptography Algorithms for Wireless sensor networks

Cryptography is that the observe and study of techniques for secure communication within the presence of third parties. It additionally plays important of wireless sensor networks. The cryptography drawback has addressed...

HYBRID PERSONALIZED RECOMMENDATION APPROACH FOR IMPROVING MOBILE E-COMMERCE

In recent years, the massive influx of information onto internet has facilitated user, not only retrieving information, but also discovering facts. However, web users usually suffer from the information overload problem...

COMPREHENSIVE STUDY AND COMPARISON OF INFORMATION DISPERSAL TECHNIQUES FOR CLOUD COMPUTING

Cloud systems refer to the collection of interconnected servers that are provisioned dynamically on demand, for execution of applications, to the customer like electricity grid. Cloud computing has gained great attention...

Download PDF file
  • EP ID EP142023
  • DOI -
  • Views 143
  • Downloads 0

How To Cite

SureshKumar. T, Sivaranjani. S, Dr. Shanthi. N (2014). A SURVEY OF TOOLS FOR EXTRACTING AND ALIGNING THE DATA IN WEB. International Journal of Computer Science & Engineering Technology, 5(3), 262-265. https://europub.co.uk/articles/-A-142023