A Comparative Analysis of Classification Algorithms on Diverse Datasets

Journal Title: Engineering, Technology & Applied Science Research - Year 2018, Vol 8, Issue 2

Abstract

Data mining involves the computational process to find patterns from large data sets. Classification, one of the main domains of data mining, involves known structure generalizing to apply to a new dataset and predict its class. There are various classification algorithms being used to classify various data sets. They are based on different methods such as probability, decision tree, neural network, nearest neighbor, boolean and fuzzy logic, kernel-based etc. In this paper, we apply three diverse classification algorithms on ten datasets. The datasets have been selected based on their size and/or number and nature of attributes. Results have been discussed using some performance evaluation measures like precision, accuracy, F-measure, Kappa statistics, mean absolute error, relative absolute error, ROC Area etc. Comparative analysis has been carried out using the performance evaluation measures of accuracy, precision, and F-measure. We specify features and limitations of the classification algorithms for the diverse nature datasets.

Authors and Affiliations

M. Alghobiri

Keywords

Related Articles

Antecedents for the Success of the Adoption of Organizational ERP Among Higher Education Institutions and Competitive Advantage in Egypt

Although the Enterprise Resource Planning (ERP) system has long been acknowledged in higher education institutions (HEIs) to improve their performance and efficiency, there are not many HEIs in Egypt that adopt ERP syste...

Renewable Energy Systems: Development and Perspectives of a Hybrid Solar-Wind System

Considering the intermittent natural energy resources and the seasonal un-balance, a phtovoltaic-wind hybrid electrical power supply system was developed to accommodate remote locations where a conventional grid connecti...

Outlining an Intelligent Tutoring System for a University Cooperation Information System

International opening of universities and research institutions is essential in the development of their research and innovation activities. Abdelmalek Essaadi University (AEU) attaches crucial importance to partnership...

Improved Genetic and Simulating Annealing Algorithms to Solve the Traveling Salesman Problem Using Constraint Programming

The Traveling Salesman Problem (TSP) is an integer programming problem that falls into the category of NP-Hard problems. As the problem become larger, there is no guarantee that optimal tours will be found within reasona...

Implementation of Building Information Modeling (BIM) in Pakistan Construction Industry

This paper examines the implementation of building information modeling (BIM) in construction industry. Various initiatives and approaches are used in different countries to promote the BIM implementation in their constr...

Download PDF file
  • EP ID EP168419
  • DOI -
  • Views 238
  • Downloads 0

How To Cite

M. Alghobiri (2018). A Comparative Analysis of Classification Algorithms on Diverse Datasets. Engineering, Technology & Applied Science Research, 8(2), -. https://europub.co.uk/articles/-A-168419