Using Game Theory to Handle Missing Data at Prediction Time of ID3 and C4.5 Algorithms

Abstract

The raw material of our paper is a well known and commonly used type of supervised algorithms: decision trees. Using a training data, they provide some useful rules to classify new data sets. But a data set with missing values is always the bane of a data scientist. Even though decision tree algorithms such as ID3 and C4.5 (the two algorithms with which we are working in this paper) represent some of the simplest pattern classification algorithms that can be applied in many domains, but with the drawback of missing data the task becomes harder because they may have to deal with unknown values in two major steps: at training step and at prediction step. This paper is involved in the processing step of databases using trees already constructed to classify the objects of these data sets. It comes with the idea to overcome the disturbance of missing values using the most famous and the central concept of the game theory approach which is the Nash equilibrium.

Authors and Affiliations

Halima Elaidi, Zahra Benabbou, Hassan Abbar

Keywords

Related Articles

Knowledge discovery from database using an integration of clustering and classification

Clustering and classification are two important techniques of data mining. Classification is a supervised learning problem of assigning an object to one of several pre-defined categories based upon the attributes of the...

NADA: New Arabic Dataset for Text Classification

In the recent years, Arabic Natural Language Processing, including Text summarization, Text simplification, Text Categorization and other Natural Language-related disciplines, are attracting more researchers. Appropriate...

Genetic algorithms to optimize base station sitting in WCDMA networks

In UMTS network, radio planning cannot only be based on signal predictions, but it must also consider the traffic distribution, the power control mechanism as well as the power limits and the signal quality constraints....

Modeling the Cut-off Frequency of Acoustic Signal with an Adaptative Neuro-Fuzzy Inference System (ANFIS)

An Adaptative Neuro-Fuzzy Inference System (ANFIS), new flexible tool, is applied to predict the cut-off frequencies of the symmetric and the anti-symmetric circumferential waves (Si and Ai, i=1,2) propagating around an...

A Machine Learning Approach for Predicting Nicotine Dependence

An examination of the ability of machine learning methodologies in classifying women Waterpipe (WP) smoker’s level of nicotine dependence is proposed in this work. In this study, we developed a classifier that predicts t...

Download PDF file
  • EP ID EP429165
  • DOI 10.14569/IJACSA.2018.091232
  • Views 106
  • Downloads 0

How To Cite

Halima Elaidi, Zahra Benabbou, Hassan Abbar (2018). Using Game Theory to Handle Missing Data at Prediction Time of ID3 and C4.5 Algorithms. International Journal of Advanced Computer Science & Applications, 9(12), 218-224. https://europub.co.uk/articles/-A-429165