Impacts of Unbalanced Test Data on the Evaluation of Classification Methods

Abstract

The performance of a classifier in a supervised machine learning problem is popularly evaluated by using the accuracy, precision, recall, and F1-score. These parameters could evaluate very well classifiers in the case that the number of positive label sample and the number of negative label sample in the testing set are balanced or nearly balanced. However, these parameters may miss-evaluate the classifiers in some case where the positive and negative samples in the testing set is unbalanced. This paper proposes some update in these parameters by taking into account the unbalanced factor which represents the unbal-ance ratio of positive and negative samples in the testing set. The new updated parameters are then experimentally evaluated to compare to the traditional parameters.

Authors and Affiliations

Manh Hung Nguyen

Keywords

Related Articles

Vision Based Geo Navigation Information Retrieval

In order to derive the three-dimensional camera position from the monocular camera vision, a geo-reference database is needed. Floor plan is a ubiquitous geo-reference database that every building refers to it during con...

Reliability and Connectivity Analysis of Vehicluar Ad Hoc Networks for a Highway Tunnel

Vehicular ad-hoc network (VANET) uses ‘mobile internet’ to facilitate the communication between vehicles and with the goal to ensure road safety and achieve secure communication. Thus the reliability of this type of netw...

A new optimization based image segmentation method by particle swarm optimization

 This paper proposes a new multilevel thresholding method segmenting images based on particle swarm optimization (PSO). In the proposed method, the thresholding problem is treated as an optimization problem, and sol...

Smart Building’s Elevator with Intelligent Control Algorithm based on Bayesian Networks

Implementation of the intelligent elevator control systems based on machine-learning algorithms should play an important role in our effort to improve the sustainability and convenience of multi-floor buildings. Traditio...

QoS Analysis to Optimize the Indoor Network IEEE 802.11 at UNTELS

This paper arose from the need to improve mobility and connectivity to network users of the Universidad Nacional Tecnológica de Lima Sur and the problems that arise on the quality of services (QoS) such as signal intermi...

Download PDF file
  • EP ID EP499607
  • DOI 10.14569/IJACSA.2019.0100364
  • Views 104
  • Downloads 0

How To Cite

Manh Hung Nguyen (2019). Impacts of Unbalanced Test Data on the Evaluation of Classification Methods. International Journal of Advanced Computer Science & Applications, 10(3), 497-502. https://europub.co.uk/articles/-A-499607