GENERAL WEB KNOWLEDGE MINING FRAMEWORK
Journal Title: International Journal on Computer Science and Engineering - Year 2012, Vol 4, Issue 10
Abstract
Mining the web is defined as discovering knowledge from hypertext and World Wide Web. The World Wide Web is one of the longest rising areas of intelligence gathering. Now a day there are billions of web pages, HTML archive accessible via the internet, and the number is still increasing. However, considering the inspiring diversity of the web, retrieving of interestingness web based content has become a very complex task. The large amount of data heterogeneity, complex format, high dimensional data and lack of structure of web, knowledge mining is a challenging task. In this paper, it is proposed to introduce a new framework generated to handle unstructured complex data. This web knowledge mining expertise brings forward a kind of XML-based distributed data mining architecture. Based on the research of web knowledge mining, XML is used to create well structured data. Web knowledge mining framework attempts to determine useful knowledge from derived data, complex format, and high dimensional data obtained from the interactions of the users through the Web.
Authors and Affiliations
B. Madasamy , Dr. J. Jebmalar Tamilselvi
An Analysis of Particle Swarm Optimization with Data Clustering-Technique for Optimization in Data Mining
Data clustering is a popular approach for automatically finding classes, concepts, or groups of patterns. Clustering aims at representing large datasets by a fewer number of prototypes or clusters. It brings simplicity i...
Security Enhancement of Dynamic System Using PID Controller and Optimization Algorithm
This paper presents a research project on a dynamic system by using a controller known as PID Controller, used to provide the simplest and yet effective solutions to most of the control engineering applications today [4]...
An Efficient Agent-Based AODV Routing Protocol in MANET
A MANET (Mobile Adhoc Network) consists of a collection of mobile nodes communicating with each other without any fixed infrastructure such as access points or base stations. MANETS are self organizing or self restoring....
SOFTWARE ARCHITECTURE BASED REGRESSION TESTING
Software architecture plays a significant role in development of a dependable system. The purpose of regression testing is to make the system fault tolerant. The amalgamation of these two, results in the development of a...
Rebroadcasting for Routing Reduction based upon Neighbor coverage in Ad Hoc Networks
Cause of nodes high mobility in mobile ad hoc networks (MANETs), there are frequent link breakages exist which escort to frequent route discoveries and path failures. The route discovery procedure cannot be ignored. In a...