An Overview of Web Content Mining Tools
Journal Title: Bonfring International Journal of Data Mining - Year 2016, Vol 6, Issue 1
Abstract
Web is one of the most widespread platforms for information exchange today, as it is easier to publish documents. As the number of users and providers increases, the number of documents grows, searching for information becomes a difficult and time-consuming process. Web mining uses various data mining techniques to discover useful knowledge from Web hyperlinks, page content and usage log file. The mining tools are used to scan the HTML documents, images, and text, the results is provided for the search engines.It can assist search engines in providing productive results of each search in order of their relevance. In this paper, we brief introduction to the concepts related to data mining, web mining and then an overview of different Web mining tools. We conclude by presenting a comparative table of these tools based on some pertinent criteria.
Authors and Affiliations
Dr Eldhose T John , Bibu Skaria , P. X. Shajan
Literature Review on Web Mining
Web is a platforms for information exchange, as it is simple and easy to publish documents. Searching for information becomes a difficult and time-consuming process as the web grows. Web mining uses various data mining t...
Multi Scheduling Reactive Resource Sharing for Dynamic Dataflow in Cloud Environment
In recent years cloud parallel data processing has emerged to be one of the killer applications for Infrastructure-as-a-Service (IaaS) clouds. Major Cloud computing companies have started to integrate frameworks for para...
Asymptotic Behavior Results for Nonlinear Impulsive Neutral Differential Equations with Positive and Negative Coefficients
This paper is focused on the following nonlinear impulsive neutral differential equation.., Sufficient conditions are obtained for every solution of (*) to tends to a constant as.,
Consensus Clustering for Microarray Gene Expression Data
Cluster analysis in microarray gene expression studies is used to find groups of correlated and co-regulated genes. Several clustering algorithms are available in the literature. However no single algorithm is optimal fo...
Improving Efficiency of Apriori Algorithms for Sequential Pattern Mining
Computer Systems are exposed to an increasing number of different types of security threats due to the expanding of internet in recent years. How to detect network intrusions effectively becomes an important security tec...