Comparison of Machine Learning Algorithms to Classify Web Pages

Abstract

The ‘World Wide Web’, or simply the web, represents one of the largest sources of information in the world. We can say that any topic we think about is probably finding it's on the web. Web information comes in different forms and types such as text documents, images and videos. However, extracting useful information, without the help of some web tools, is not an easy process. Here comes the role of web mining, which provides the tools that help us to extract useful knowledge from data on the internet. Many researchers focus on the issue of web pages classification technology that provides high accuracy. In this paper, several ‘supervised learning algorithms’ evaluation to determining the predefined categories among web documents. We use machine learning algorithms ‘Artificial Neural Networks (ANN)’, ‘Random Forest (RF)’, ‘AdaBoost’ to perform a behavior comparison on the web pages classifications problem.

Authors and Affiliations

Ansam A. AbdulHussien

Keywords

Related Articles

 ID Numbers Recognition by Local Similarity Voting

  This paper aims to recognize ID numbers from three types of valid identification documents in China: the first-generation ID card, the second-generation ID card and the driver license of motor vehicle. We hav...

User based Recommender Systems using Implicative Rating Measure

This paper proposes the implicative rating measure developed on the typicality measure. The paper also proposes a new recommendation model presenting the top N items to the active users. The proposed model is based on th...

Mining Opinion in Online Messages

The number of messages that can be mined from online entries increases as the number of online application users increases. In Malaysia, online messages are written in mixed languages known as ‘Bahasa Rojak’. Therefore,...

Evaluation of Peer Robot Communications using CryptoROS

The demand of cloud robotics makes data encryp-tion essential for peer robot communications. Certain types of data such as odometry, action controller and perception data need to be secured to prevent attacks. However, t...

Motion Blobs as a Feature for Detection on Smoke 

Disturbance that is caused due to visual perception with the atmosphere is coined as smoke, but the major problem is to quantify the detected smoke that is made up of small particles of carbonaceous matter in the air, re...

Download PDF file
  • EP ID EP240697
  • DOI 10.14569/IJACSA.2017.081127
  • Views 83
  • Downloads 0

How To Cite

Ansam A. AbdulHussien (2017). Comparison of Machine Learning Algorithms to Classify Web Pages. International Journal of Advanced Computer Science & Applications, 8(11), 205-209. https://europub.co.uk/articles/-A-240697