Advanced Annotation Creator for Search Results from Web Databases

Abstract

A large portion of the deep web is database based, i.e., for many search engines, data encoded in the returned result pages come from the underlying structured data-bases. Such type of search engines is often referred as Web databases (WDB). An increasing number of databases have become web accessible through HTML form-based search interfaces. The data units returned from the underlying database are usually encoded into the result pages dynamically for human browsing. For the encoded data units to be machine processable, which is essential for many applications such as deep web data collection and Internet comparison shopping, they need to be extracted out and assigned meaningful labels. In this paper, we present an automatic annotation approach that first aligns the data units on a result page into different groups such that the data in the same group have the same semantic. Then, for each group we annotate it from different aspects and aggregate the different annotations to predict a final annotation label for it. An annotation wrapper for the search site is automatically constructed and can be used to annotate new result pages from the same web database. Our experiments indicate that the proposed approach is highly effective. The application is designed using Microsoft Visual Studio .Net 2005 as front end. The coding language used is Visual C# .Net. MS-SQL Server 2000 is used as back end database.

Authors and Affiliations

Gayathri Thangavel, Menaka Chinnasamy

Keywords

Related Articles

Analysis the Phytoconstituents and AntiInflammatory Potentials of Indian Medicinal Plant Anethum Graveolens L

Unlike modern allopathic drugs which are single active components that target one specific pathway, herbal medicines work in a way that depends on an orchestral approach. The use of herbal medicines becoming popular due...

Simulation of FT-IR and FT-Raman Spectra Based on Scaled DFT Calculations, Vibrational Assignments, Hyperpolarizability, NMR Chemical Shifts and Homo-Lumo Analysis of 1-Chloro-4-Nitrobenzene

This work deals with the vibrational spectroscopy of 1-chloro-4-nitrobenzene (1C4NB) by means of quantum chemical calculations. The solid phase FT-IR and FT-Raman spectra of 1-chloro-4-nitrobenzene (1C4NB) have been rec...

Review Paper on Low Cost Conveyor Design Reduction of Weight of Conveyor System

The aim of this project is to redesign existing roller conveyor system by designing the main part Roller to minimize the overall weight of the assembly and to save considerable amount of material.

Sliding Mode Control Techique For DC-DC Buck Converter With Improved Performance

This paper presents Sliding Mode controlled, continuous conduction mode buck converter is modelled and a practical sliding mode voltage controller for buck converter operating in continuous conduction mode has been impl...

Recent Development in Software Project Evaluation Techniques

The main outline of this paper is to provide an idea to study various techniques of software evaluation and analyse them in an efficient and effective manner. We will actually see how various methodologies have been ado...

Download PDF file
  • EP ID EP20856
  • DOI -
  • Views 301
  • Downloads 4

How To Cite

Gayathri Thangavel, Menaka Chinnasamy (2015). Advanced Annotation Creator for Search Results from Web Databases. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 3(5), -. https://europub.co.uk/articles/-A-20856