Using Game Theory to Handle Missing Data at Prediction Time of ID3 and C4.5 Algorithms
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2018, Vol 9, Issue 12
Abstract
The raw material of our paper is a well known and commonly used type of supervised algorithms: decision trees. Using a training data, they provide some useful rules to classify new data sets. But a data set with missing values is always the bane of a data scientist. Even though decision tree algorithms such as ID3 and C4.5 (the two algorithms with which we are working in this paper) represent some of the simplest pattern classification algorithms that can be applied in many domains, but with the drawback of missing data the task becomes harder because they may have to deal with unknown values in two major steps: at training step and at prediction step. This paper is involved in the processing step of databases using trees already constructed to classify the objects of these data sets. It comes with the idea to overcome the disturbance of missing values using the most famous and the central concept of the game theory approach which is the Nash equilibrium.
Authors and Affiliations
Halima Elaidi, Zahra Benabbou, Hassan Abbar
Building a Penetration Testing Device for Black Box using Modified Linux for Under $50
This study analyzes the use of a Raspberry Pi (RPi) as part of a Penetration Tester’s toolkit. The RPi’s form factor, performance to cost ratio, used in conjunction with modified Linux, allows the RPi to be a very versat...
Parallel Backpropagation Neural Network Training Techniques using Graphics Processing Unit
Training of artificial neural network using back-propagation is a computational expensive process in machine learning. Parallelization of neural networks using Graphics Pro-cessing Unit (GPU) can help to reduce the time...
A Robust System for Noisy Image Classification Combining Denoising Autoencoder and Convolutional Neural Network
Image classification, a complex perceptual task with many real life important applications, faces a major challenge in presence of noise. Noise degrades the performance of the classifiers and makes them less suitable in...
Developing Computer Network Based on EIGRP Performance Comparison and OSPF
One of the computer network systems technologies that are growing rapidly at this time is internet. In building the networks, a routing mechanism is needed to integrate the entire computer with a high degree of flexibili...
Virtualizing a Cluster to Optimize the Problems of High Scientific Complexity within an Organization
The Image Processing Research Laboratory (INTI-Lab) of the Universidad de Ciencias y Humanidades has several research projects related to computer science needing high computational resources. Some of these projects are...