Advanced Annotation Creator for Search Results from Web Databases

Abstract

A large portion of the deep web is database based, i.e., for many search engines, data encoded in the returned result pages come from the underlying structured data-bases. Such type of search engines is often referred as Web databases (WDB). An increasing number of databases have become web accessible through HTML form-based search interfaces. The data units returned from the underlying database are usually encoded into the result pages dynamically for human browsing. For the encoded data units to be machine processable, which is essential for many applications such as deep web data collection and Internet comparison shopping, they need to be extracted out and assigned meaningful labels. In this paper, we present an automatic annotation approach that first aligns the data units on a result page into different groups such that the data in the same group have the same semantic. Then, for each group we annotate it from different aspects and aggregate the different annotations to predict a final annotation label for it. An annotation wrapper for the search site is automatically constructed and can be used to annotate new result pages from the same web database. Our experiments indicate that the proposed approach is highly effective. The application is designed using Microsoft Visual Studio .Net 2005 as front end. The coding language used is Visual C# .Net. MS-SQL Server 2000 is used as back end database.

Authors and Affiliations

Gayathri Thangavel, Menaka Chinnasamy

Keywords

Related Articles

A Measurement of Medium Range of Underground Coal Mine Using Wireless Sensor Network

In this paper can extend and intend to monitor subversive coal mine by using the transceiver ZigBee wireless network. It can detect three ways of sensor that is humidity sensor, temperature sensor, gas sensor. Ones it c...

IOT Based Smart Environmental Monitoring Using Arduino

This paper proposes an approach to build a cost effective standardized environmental monitoring device using the Arduino Board. The system was designed using Embedded C Programming language and can be controlled and acc...

Fault Diagnosis and Monitoring In Wind Turbine Using Can Bus

This paper is a CAN based architecture intended for the purpose of monitoring and fault diagnosis of wind turbine. CAN is a memo based protocol designed specifically for automotive, late aerospace, industrial automation...

Micro converter fed BLDC motor using PV applications

In this paper using a transformer less step up voltage method is used. Reduced the losses and improved the power quality by using this proposed method. Within the photovoltaic (PV) power-generation marketplace, the ac P...

Wireless Charging Techniques – A Survey

In recent years, there has been an enormous development in the field of wireless technologies. Wireless charging technologies uses Inductive power transfer, Magnetic resonance coupling transfer and RF radiation to charg...

Download PDF file
  • EP ID EP20856
  • DOI -
  • Views 246
  • Downloads 4

How To Cite

Gayathri Thangavel, Menaka Chinnasamy (2015). Advanced Annotation Creator for Search Results from Web Databases. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 3(5), -. https://europub.co.uk/articles/-A-20856