Reducing Human Effort: Web Data Mining, Learning a New Characteristics from Big Data

Journal Title: GRD Journal for Engineering - Year 2015, Vol 1, Issue 1

Abstract

This paper presents a Reducing Human Effort: Web Data Mining, Learning a New Characteristics from Big data, reducing human effort in extracting precise information from undetected Web sites. Our approach aims at automatically adapting the information extraction knowledge previously learned from a source Web site to a new undetected site, at the same time, discovering previously undetected attributes. There is a two kinds of text related evidences from the source Web site are considered. The first kind of evidences is obtained from the extraction pattern contained in the previously learned wrapper. The second kind of evidences is derived from the previously extracted or collected items. A generative model for the generation of the web site independent content information and the site dependent layout format of the text fragments related to attribute values contained in a Web page is designed to connect the insecurity involved. We have conducted extensive experiments from more than 50 real world Web sites in more than five different domains to demonstrate the effectiveness of our context.

Authors and Affiliations

Mr. M. Srinivasan, Dr. S. Koteeswaran

Keywords

Related Articles

Experimental Study on Use of Quarry Dust and Fly Ash with Partial Replacement of Fine Aggregates and Cement in Concrete

Present era is the era of concrete because it is highly demanded material today. The basic raw materials for concrete are cement, sand and coarse aggregate. Due to depletion of natural rivers it is required to use other...

CFD Integrated Optimum Design and Prototyping of Shell and Tube Heat Exchanger

In present work, comparison of three different tube bundles for particular heat exchanger is proposed. Three types are smooth, micro finned and corrugated tubes. Heat exchanger will be designed with smooth tube bundle an...

A Novel Multiplier Design Using Adaptive Hold Logic to Mitigate BTI Effect

The overall performance of a system depends on the performance of the multipliers, thus digital multipliers are among the most critical arithmetic functional units; but their performance is affected by negative bias temp...

Development Phases of Technologies in Face Recognition Systems

Face recognition is for recognizing human faces from single images out of a large database. The task is difficult because of image variation in terms of position, size, expression, and pose and it is important because th...

OEE - A Tool to Measure the Effectiveness of TPM Implementation in Industries - A Review

This paper aims to study the measurement of effectiveness of TPM implementation in manufacturing and service industries. Here an attempt was made to discuss the previous literature related to the TPM implementation and O...

Download PDF file
  • EP ID EP216626
  • DOI -
  • Views 161
  • Downloads 0

How To Cite

Mr. M. Srinivasan, Dr. S. Koteeswaran (2015). Reducing Human Effort: Web Data Mining, Learning a New Characteristics from Big Data. GRD Journal for Engineering, 1(1), 13-19. https://europub.co.uk/articles/-A-216626