Reducing Human Effort: Web Data Mining, Learning a New Characteristics from Big Data
Journal Title: GRD Journal for Engineering - Year 2015, Vol 1, Issue 1
Abstract
This paper presents a Reducing Human Effort: Web Data Mining, Learning a New Characteristics from Big data, reducing human effort in extracting precise information from undetected Web sites. Our approach aims at automatically adapting the information extraction knowledge previously learned from a source Web site to a new undetected site, at the same time, discovering previously undetected attributes. There is a two kinds of text related evidences from the source Web site are considered. The first kind of evidences is obtained from the extraction pattern contained in the previously learned wrapper. The second kind of evidences is derived from the previously extracted or collected items. A generative model for the generation of the web site independent content information and the site dependent layout format of the text fragments related to attribute values contained in a Web page is designed to connect the insecurity involved. We have conducted extensive experiments from more than 50 real world Web sites in more than five different domains to demonstrate the effectiveness of our context.
Authors and Affiliations
Mr. M. Srinivasan, Dr. S. Koteeswaran
Intelligent Pothole Repair Vehicle
Identifying and repairing potholes on the roads is labour intensive and expensive. It typically requires three or four people to do the monotonous job in difficult environments. So this is a great opportunity for a type...
Longitudinal Velocity Distribution in Straight and Curved Open Channels: A Model Study
This paper presents the experimental investigation regarding longitudinal velocity distribution in straight and curved reaches of an open channel. Extensive data has been collected in the laboratory flume with straight a...
Tourism and Culture: Pioneers of Development Local Rural Resort, Hodka Village (Bhuj)
Village or rural tourism showcases the rural culture and brings economic benefits to the communities, received a major thrust under India’s 10th Five Year Plan and was accorded priority. Primary focus is given on the inf...
Web Based Online Examination System
An Online Examination System is a web software solution, which allows any institute or industry to set up, direct and manage examinations via an online environs. Some of the problems faced during manual examination syste...
An Efficient Extreme Learning Machine Based Intrusion Detection System
This paper presents an intrusion detection technique based on online sequential extreme learning machine. For performance evaluation, KDDCUP99 dataset is used. In this paper, we use three feature selection techniques – f...