Reliable Algorithm for Extracting Web Data
Journal Title: International Journal of Research in Computer and Communication Technology - Year 2013, Vol 2, Issue 1
Abstract
Web usage mining is a process of extracting useful information from server logs i.e. users history. Web usage mining is the process of finding out what users are looking for on the Internet. Some users might be looking at only textual data, whereas some others might be interested in multimedia data. One would retrieve the data by copying it and pasting it to the relevant document. But this is tedious and time-consuming as well as difficult when the data to be retrieved is plenty. Extracting structured data from a web page is challenging problem due to complicated structured pages. In previous they will use web page programming language dependent, the main problem is to analyze the html source code. In previous they will consider the scripts such as java script and cascade styles in the html files. It makes for difficulty for existing solutions to infer the regularity of the structure of WebPages only by analyzing the tag structures. To overcome this problem we are using a new technique called VIPS algorithm (vision based page segmentation) i.e. independent language. This approach primary utilizes the visual features on the webpage to implement web data extraction.
Authors and Affiliations
R. V. V Satyanarayana, Mortha Chinnarao, sudhir varma raju, B. N Jagadesh
Design of Low Power FPGA using Autonomous Power Gating and LEDR Encoding
The most important key challenge in the IC scaling era is to deliver high performance solutions in the process of minimizing power, area and cost. The main objective of this paper is to reduce power consumption by co...
Performance of Patch Antenna In Air And Fresh Water
This paper examines the performance of a Microstrip fed rectangular patch antenna in air and water. Antennas are designed and simulated with the help of HFSS (High Frequency Structure Simulator) software. The antenna w...
Scent Rupture Nodes Using DCD In Wireless Sensor Networks
A cut is nothing but a part of wireless sensor networks which is splited into different connection components because of some node failures in the network. This paper proposes a new algorithm to detect these cuts by...
A Novel Methodology For Discovering Ideal Overlay Hubs In Routing
In the event that we are simply worried in showing signs of improvement directing properties among a solitary source hub and a solitary destination, then the quandary is not unpredictable, and judgment the ideal numb...
Optimum Loss Allocation in Radial Distribution Systems
In this paper, the proposed method has the advantage that no assumptions are made in the allocation of real power losses as opposed other algorithms available in the literature. A detailed comparison of the real loss...