A SURVEY OF TOOLS FOR EXTRACTING AND ALIGNING THE DATA IN WEB

Abstract

The world-wide web is rapidly growing day by day in all fields, mining the data from multiple websites is necessary to filter the relevant contents. Although many approaches developed for extracting the data, there were some difficulties found when using such tools. In this paper, we survey web data extraction and alignment process in two dimensions: record extraction and alignment. The first dimension explains the extracting data records from multiple query result pages automatically. The second one measures similarity between the data records for aligning the records by pairwise and holistically and then nested structure processing. We believe these criteria enhance the performance measures to check existing data extraction methods.

Authors and Affiliations

SureshKumar. T , Sivaranjani. S , Dr. Shanthi. N

Keywords

Related Articles

Robust Face Recognition Using Artificial Neural Network

Face recognition is done naturally by humans. However, developing a computer algorithm to do the same thing is difficult. Assume for the moment we start with images, and we want to distinguish between images of different...

A Review on Tissue Segmentation and Feature Extraction of MRI Brain images

Magnetic resonance imaging (MRI) is an important diagnostic imaging technique for the early detection of brain cancer. Brain cancer is one of the most dangerous diseases occurring commonly among human beings. The chances...

Techniques of Software Fault Tolerance

Fault tolerance is the ability of a system to perform its function correctly even in the presence of internal faults. We should accept that, relying on software techniques for obtaining dependability means accepting some...

DETECTING AND BLOCKING OF SPAM ZOMBIE MECHANISM

A zombie is a computer connected to the Internet that has been compromised by a hacker, computer virus or Trojan horse and can be used to perform malicious tasks of one sort or another under remote direction. Botnets of...

Protein-Protein Interaction Classification Using Jordan Recurrent Neural Network

Proteins form a very important part of a living cell. The biological functions are carried out by the proteins within the cell by interacting with other proteins in other cells. This is called protein-protein interaction...

Download PDF file
  • EP ID EP142023
  • DOI -
  • Views 127
  • Downloads 0

How To Cite

SureshKumar. T, Sivaranjani. S, Dr. Shanthi. N (2014). A SURVEY OF TOOLS FOR EXTRACTING AND ALIGNING THE DATA IN WEB. International Journal of Computer Science & Engineering Technology, 5(3), 262-265. https://europub.co.uk/articles/-A-142023