Impacts of Unbalanced Test Data on the Evaluation of Classification Methods

Abstract

The performance of a classifier in a supervised machine learning problem is popularly evaluated by using the accuracy, precision, recall, and F1-score. These parameters could evaluate very well classifiers in the case that the number of positive label sample and the number of negative label sample in the testing set are balanced or nearly balanced. However, these parameters may miss-evaluate the classifiers in some case where the positive and negative samples in the testing set is unbalanced. This paper proposes some update in these parameters by taking into account the unbalanced factor which represents the unbal-ance ratio of positive and negative samples in the testing set. The new updated parameters are then experimentally evaluated to compare to the traditional parameters.

Authors and Affiliations

Manh Hung Nguyen

Keywords

Related Articles

A Model for Forecasting the Number of Cases and Distribution Pattern of Dengue Hemorrhagic Fever in Indonesia

Dengue Hemorrhagic Fever (DHF) ourbreaks is one of the lethal health problems in Indonesia. Aedes aegypti type of insect prolefiration as the main vector of DHF has affected climate factors, such as temperature, humidity...

Design and Simulation of a Novel Dual Band Microstrip Antenna for LTE-3 and LTE-7 Bands

Long Term Evolution (LTE) is currently being used in many developed countries and hopefully will be implemented in more countries. An antenna operating in LTE-3 band can support global roaming in ITU Regions 1 and 3, Cos...

Reverse Area Skyline in a Map

Skyline query retrieves a set of data objects, each of which is not dominated by another object. On the other hand, given a query object q, “reverse” skyline query retrieves a set of points that are “dynamic” skyline of...

Audio Watermarking with Error Correction 

In recent times, communication through the internet has tremendously facilitated the distribution of multimedia data. Although this is indubitably a boon, one of its repercussions is that it has also given impetus to the...

 Transforming Conceptual Model into Logical Model for Temporal Data Warehouse Security: A Case Study

 Extraction–transformation–loading (ETL) processes are responsible for the extraction of data from several sources, their cleansing, customization and insertion into a data warehouse. Data warehouse often store hist...

Download PDF file
  • EP ID EP499607
  • DOI 10.14569/IJACSA.2019.0100364
  • Views 79
  • Downloads 0

How To Cite

Manh Hung Nguyen (2019). Impacts of Unbalanced Test Data on the Evaluation of Classification Methods. International Journal of Advanced Computer Science & Applications, 10(3), 497-502. https://europub.co.uk/articles/-A-499607