Comparison of Machine Learning Algorithms to Classify Web Pages
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2017, Vol 8, Issue 11
Abstract
The ‘World Wide Web’, or simply the web, represents one of the largest sources of information in the world. We can say that any topic we think about is probably finding it's on the web. Web information comes in different forms and types such as text documents, images and videos. However, extracting useful information, without the help of some web tools, is not an easy process. Here comes the role of web mining, which provides the tools that help us to extract useful knowledge from data on the internet. Many researchers focus on the issue of web pages classification technology that provides high accuracy. In this paper, several ‘supervised learning algorithms’ evaluation to determining the predefined categories among web documents. We use machine learning algorithms ‘Artificial Neural Networks (ANN)’, ‘Random Forest (RF)’, ‘AdaBoost’ to perform a behavior comparison on the web pages classifications problem.
Authors and Affiliations
Ansam A. AbdulHussien
ID Numbers Recognition by Local Similarity Voting
This paper aims to recognize ID numbers from three types of valid identification documents in China: the first-generation ID card, the second-generation ID card and the driver license of motor vehicle. We hav...
User based Recommender Systems using Implicative Rating Measure
This paper proposes the implicative rating measure developed on the typicality measure. The paper also proposes a new recommendation model presenting the top N items to the active users. The proposed model is based on th...
Mining Opinion in Online Messages
The number of messages that can be mined from online entries increases as the number of online application users increases. In Malaysia, online messages are written in mixed languages known as ‘Bahasa Rojak’. Therefore,...
Evaluation of Peer Robot Communications using CryptoROS
The demand of cloud robotics makes data encryp-tion essential for peer robot communications. Certain types of data such as odometry, action controller and perception data need to be secured to prevent attacks. However, t...
Motion Blobs as a Feature for Detection on Smoke
Disturbance that is caused due to visual perception with the atmosphere is coined as smoke, but the major problem is to quantify the detected smoke that is made up of small particles of carbonaceous matter in the air, re...