Reducing Human Effort: Web Data Mining, Learning a New Characteristics from Big Data
Journal Title: GRD Journal for Engineering - Year 2015, Vol 1, Issue 1
Abstract
This paper presents a Reducing Human Effort: Web Data Mining, Learning a New Characteristics from Big data, reducing human effort in extracting precise information from undetected Web sites. Our approach aims at automatically adapting the information extraction knowledge previously learned from a source Web site to a new undetected site, at the same time, discovering previously undetected attributes. There is a two kinds of text related evidences from the source Web site are considered. The first kind of evidences is obtained from the extraction pattern contained in the previously learned wrapper. The second kind of evidences is derived from the previously extracted or collected items. A generative model for the generation of the web site independent content information and the site dependent layout format of the text fragments related to attribute values contained in a Web page is designed to connect the insecurity involved. We have conducted extensive experiments from more than 50 real world Web sites in more than five different domains to demonstrate the effectiveness of our context.
Authors and Affiliations
Mr. M. Srinivasan, Dr. S. Koteeswaran
Emergency Vehicle Priority System For Smart Cities
A steady increase in metro-city population, the number of automobiles and cars increases rapidly and metro traffic is growing crowded which leads to the traffic jam problem. This proposed system will have effective role...
Generation of Bio-Fuel (Bio- Briquettes) from Agricultural Waste: A Review
Waste is an unavoidable by product of most human activities which is increasing day by day with increase in economic development and rising living standards. The principal of solid waste are residential households and ag...
Automatic Sorting in Process Industries using PLC
Sorting is an important thing in which any items or products can be differentiated based on their size, height and color. In order to sort items, we need to be able to compare them, i.e., to determine whether the object...
Using Fly Ash and Laterite as a Filtering Media
The present study reports the comparative study of soil-based constructed soil filter system monitored for about 2 months for removal of turbidity of turbid water in which fly ash and laterite is used as filtering media....
To Study Pedestrian Safety at Undesignated Urban Midblock Section by User’s Perception
Due to lack of pedestrian walking and crossing facilities pedestrian mostly used regular traffic lane. Continuously increase in motor vehicles increases chances of collision with pedestrians. In such scenario, pedestrian...