Improvement in Classification Algorithms through Model Stacking with the Consideration of their Correlation

Abstract

In this research we analyzed the performance of some well-known classification algorithms in terms of their accuracy and proposed a methodology for model stacking on the basis of their correlation which improves the accuracy of these algorithms. We selected; Support Vector Machines (svm), Naïve Bayes (nb), k-Nearest Neighbors (knn), Generalized Linear Model (glm), Latent Discriminant Analysis (lda), gbm, Recursive Partitioning and Regression Trees (rpart), rda, Neural Networks (nnet) and Conditional Inference Trees (ctree) in our research and preformed analyses on three textual datasets of different sizes; Scopus 50,000 instances, IMDB Movie Reviews having 10,000 instances, Amazon Products Reviews having 1000 instances and Yelp dataset having 1000 instances. We used R-Studio for performing experiments. Results show that the performance of all algorithms increased at Meta level. Neural Networks achieved the best results with more than 25% improvement at Meta-Level and outperformed the other evaluated methods with an accuracy of 95.66%, and altogether our model gives far better results than individual algorithms’ performance.

Authors and Affiliations

Muhammad Azam, Dr. Tanvir Ahmed, Dr. M. Usman Hashmi, Rehan Ahmad, Abdul Manan, Fahad Sabah

Keywords

Related Articles

Hybrid Approach for Detection of Hard Exudates

Diabetic Retinopathy is a severe and widely spread eye disease which can lead to blindness. Hence, early detection of Diabetic Retinopathy is a must. Hard Exudates are the primary sign of Diabetic Retinopathy. Early trea...

QR Code Patterns Localization based on Hu Invariant Moments

The widespread utilization of QR code and its coincidence with the swift growth of e-commerce transactions have imposed the computer vision researchers to continuously devise a variety of QR code recognition algorithms....

Hadoop MapReduce for Parallel Genetic Algorithm to Solve Traveling Salesman Problem

Achieving an optimal solution for NP-complete problems is a big challenge nowadays. The paper deals with the Traveling Salesman Problem (TSP) one of the most important combinatorial optimization problems in this class. W...

Muscles Heating Analysis in Sportspeople to Prevent Muscle Injuries using Thermal Images

Muscle heating is the process that every athlete follows before any physical activity or sport which are the legs where greater force is exerted and in case a good heating routine is not practiced, the muscles can suffer...

A New Project Risk Management Model based on Scrum Framework and Prince2 Methodology

With increasing competition in the software industry, software companies need to effectively manage the risks of software projects with minimal time and cost to deliver high quality products. High frequencies of warning...

Download PDF file
  • EP ID EP499588
  • DOI 10.14569/IJACSA.2019.0100360
  • Views 107
  • Downloads 0

How To Cite

Muhammad Azam, Dr. Tanvir Ahmed, Dr. M. Usman Hashmi, Rehan Ahmad, Abdul Manan, Fahad Sabah (2019). Improvement in Classification Algorithms through Model Stacking with the Consideration of their Correlation. International Journal of Advanced Computer Science & Applications, 10(3), 463-475. https://europub.co.uk/articles/-A-499588