A Comparison of Five Methods for Missing Value Imputation in Data Sets
Journal Title: International Scientific and Vocational Studies Journal - Year 2018, Vol 2, Issue 2
Abstract
The missing values in the data sets do not allow for accurate analysis. Therefore, the correct imputation of missing values has become the focus of attention of researchers in recent years. This paper focuses on a comparison of most reliable and up to date estimation methods to imputing the missing values. Imputation of missing values has a very high priority because of its impact on next pre-processing, data analysis, classification, clustering, etc. Root mean square error (RMSE) value, classification accuracy and execution time are used to evaluate the performances of most popular five methods (mean, k-nearest neighbors, singular value decomposition, bayesian principal component analysis and missForest). When RMSE and classification accuracy values of methods were compared, it has observed that missForest method outperformed other methods in all datasets.
Authors and Affiliations
Pınar Cihan
Improved Compound Multiphase Waveforms with Additional Amplitude Modulation (periodic mode) for Marine Radars
This paper has presented the basis of a compound multiphase waveform design with additional amplitude modulation, capable of controlling a waveform pick-factor, suitable for use with marine radar. The waveform shows goo...
Strategies That Transform the Retail
The objective of the present investigation is to analyze the strategies that the companies implement and the changes that originate in the retail trade and the retail trade and in the commercial establishments in the dev...
Thermal Stress Control in Functionally Graded Plates with Artificial Neural Network
In this study, trained models were obtained by using Artificial Neural Network (ANN) in order to determine the equivalent stress levels of one dimensional functionally graded rectangular plates. In this training set, a...
The Mesozoic Stratigraphy and Ammonite Faune for Niksar-Erbaa (Tokat) Territory
The basement of the study territory covering the Niksar-Erbaa territories and their immediate vicinity, located on the north and above the North Anatolian Fault, forms the rocks belonging to Permo-Triassic aged Karakaya...
Prediction of Evaporation Values of Konya Closed Basin via Developed Empirical Formula
Accurate evaporation prediction is significant for the management of water resources systems. The advantage of empirical formulas is that they don’t require a lot of parameters. In this study, evaporation values of mete...