Comparison of Machine Learning Algorithms to Classify Web Pages

Abstract

The ‘World Wide Web’, or simply the web, represents one of the largest sources of information in the world. We can say that any topic we think about is probably finding it's on the web. Web information comes in different forms and types such as text documents, images and videos. However, extracting useful information, without the help of some web tools, is not an easy process. Here comes the role of web mining, which provides the tools that help us to extract useful knowledge from data on the internet. Many researchers focus on the issue of web pages classification technology that provides high accuracy. In this paper, several ‘supervised learning algorithms’ evaluation to determining the predefined categories among web documents. We use machine learning algorithms ‘Artificial Neural Networks (ANN)’, ‘Random Forest (RF)’, ‘AdaBoost’ to perform a behavior comparison on the web pages classifications problem.

Authors and Affiliations

Ansam A. AbdulHussien

Keywords

Related Articles

Improved Tracking Using a Hybrid Optcial-Haptic Three-Dimensional Tracking System

The aim of this paper is to asses to what extent an optical tracking system (OTS) used for position tracking in virtual reality can be improved by combining it with a human scale haptic device named Scalable-SPIDAR. The...

Multi-input Multi-output Beta Wavelet Network: Modeling of Acoustic Units for Speech Recognition

In this paper, we propose a novel architecture of wavelet network called Multi-input Multi-output Wavelet Network MIMOWN as a generalization of the old architecture of wavelet network. This newel prototype was applied to...

A Review on the Verification Approaches and Tools used to Verify the Correctness of Security Algorithms and Protocols

Security algorithms and protocols are typical essential upgrades that must be involved within systems and their structures to provide the best performance. The protocols and systems should go through verification and tes...

Middleware to integrate heterogeneous Learning Management Systems and initial results

The use of the Learning Management Systems (LMS) has been increased. It is desirable to access multiple learning objects that are managed by Learning Management Systems. The diversity of LMS allow us to consider them as...

Suitable Personality Traits for Learning Programming Subjects: A Rough-Fuzzy Model

Programming is a cognitive activity which requires logical reasoning to code for abstract presentation. This study aims to find out the personality traits of students who maintain the effective grades in learning program...

Download PDF file
  • EP ID EP240697
  • DOI 10.14569/IJACSA.2017.081127
  • Views 90
  • Downloads 0

How To Cite

Ansam A. AbdulHussien (2017). Comparison of Machine Learning Algorithms to Classify Web Pages. International Journal of Advanced Computer Science & Applications, 8(11), 205-209. https://europub.co.uk/articles/-A-240697