A Review of Various Techniques of Web Content Mining For HTML and XML Contents
Journal Title: International Journal of Research in Computer and Communication Technology - Year 2014, Vol 3, Issue 6
Abstract
World Wide Web is the largest source of information. Most of the data on the web is dynamic and is in unstructured form. It is becoming difficult to get the relevant data from the web. Data Mining is the field of computer science which is used to extract knowledge from very large amount of data. Web mining is the application of data mining, which implements various techniques of data mining to get the efficient knowledge from the web data. This paper presents an overview of various techniques that has been used for web content mining including images, audio, video and semi-structured contents like HTML and XML. Since HTML has many limitations like limited tags, not case sensitive and designed to display data only, Web developers has started to develop Web pages on emerging Web Technologies like XML, Flash etc. XML was designed to describe data and to focus on what the data is. XML also plays the role of a meta- language and allows document authors to create customized markup language for limitless different types of documents, making it a standard data format for online data exchange.
Authors and Affiliations
Rupinder Kaur, Kamaljit Kaur
A Secured Rank Based Multibiometrics System using Enhance Blind Encryption Technique
The unimodal biometric systems experiences significant limitations due to sensitivity to noise, intraclass variability, data quality, non-universality, and other factors. An attempt to improve the performance of indi...
Evaluation of Forecast Scheme Performances Based on Statistical Error Measurements
This paper will review various forecast schemes. Some of the forecast schemes are based on data mining. Forecast schemes are introduced to reduce urban traffic congestion and to manage the travel information. The und...
Infiltrate Testing Tool for Web Services Security
For distributed computing solutions Web Services are widely used. Web Services technology is used to integrate existing homogenous or heterogeneous enterprise applications. It can also be used to build inter-operable...
Porting and board bring up of Mini2440 using U-boot and NFS server
U-boot (Universal Boot Loader) has more features and fast updating speed. More over it supports more number of file systems as compared to other boot loaders and customized boot process. This article, through config...
Medical and Multimedia Image Compression using Multi Resolution Transforms
The development of image compression algorithms is the chief matter of concern in the field of image and video processing as the amount of data either in the form of images or videos, sent over internet is of elephant...