A Comparative Analysis of Classification Algorithms on Diverse Datasets
Journal Title: Engineering, Technology & Applied Science Research - Year 2018, Vol 8, Issue 2
Abstract
Data mining involves the computational process to find patterns from large data sets. Classification, one of the main domains of data mining, involves known structure generalizing to apply to a new dataset and predict its class. There are various classification algorithms being used to classify various data sets. They are based on different methods such as probability, decision tree, neural network, nearest neighbor, boolean and fuzzy logic, kernel-based etc. In this paper, we apply three diverse classification algorithms on ten datasets. The datasets have been selected based on their size and/or number and nature of attributes. Results have been discussed using some performance evaluation measures like precision, accuracy, F-measure, Kappa statistics, mean absolute error, relative absolute error, ROC Area etc. Comparative analysis has been carried out using the performance evaluation measures of accuracy, precision, and F-measure. We specify features and limitations of the classification algorithms for the diverse nature datasets.
Authors and Affiliations
M. Alghobiri
Antecedents for the Success of the Adoption of Organizational ERP Among Higher Education Institutions and Competitive Advantage in Egypt
Although the Enterprise Resource Planning (ERP) system has long been acknowledged in higher education institutions (HEIs) to improve their performance and efficiency, there are not many HEIs in Egypt that adopt ERP syste...
Renewable Energy Systems: Development and Perspectives of a Hybrid Solar-Wind System
Considering the intermittent natural energy resources and the seasonal un-balance, a phtovoltaic-wind hybrid electrical power supply system was developed to accommodate remote locations where a conventional grid connecti...
Outlining an Intelligent Tutoring System for a University Cooperation Information System
International opening of universities and research institutions is essential in the development of their research and innovation activities. Abdelmalek Essaadi University (AEU) attaches crucial importance to partnership...
Improved Genetic and Simulating Annealing Algorithms to Solve the Traveling Salesman Problem Using Constraint Programming
The Traveling Salesman Problem (TSP) is an integer programming problem that falls into the category of NP-Hard problems. As the problem become larger, there is no guarantee that optimal tours will be found within reasona...
Implementation of Building Information Modeling (BIM) in Pakistan Construction Industry
This paper examines the implementation of building information modeling (BIM) in construction industry. Various initiatives and approaches are used in different countries to promote the BIM implementation in their constr...