Imputation And Classification Of Missing Data Using Least Square Support Vector Machines – A New Approach In Dementia Diagnosis

Abstract

This paper presents a comparison of different data imputation approaches used in filling missing data and proposes a combined approach to estimate accurately missing attribute values in a patient database. The present study suggests a more robust technique that is likely to supply a value closer to the one that is missing for effective classification and diagnosis. Initially data is clustered and z-score method is used to select possible values of an instance with missing attribute values. Then multiple imputation method using LSSVM (Least Squares Support Vector Machine) is applied to select the most appropriate values for the missing attributes. Five imputed datasets have been used to demonstrate the performance of the proposed method. Experimental results show that our method outperforms conventional methods of multiple imputation and mean substitution. Moreover, the proposed method CZLSSVM (Clustered Z-score Least Square Support Vector Machine) has been evaluated in two classification problems for incomplete data. The efficacy of the imputation methods have been evaluated using LSSVM classifier. Experimental results indicate that accuracy of the classification is increases with CZLSSVM in the case of missing attribute value estimation. It is found that CZLSSVM outperforms other data imputation approaches like decision tree, rough sets and artificial neural networks, K-NN (K-Nearest Neighbour) and SVM. Further it is observed that CZLSSVM yields 95 per cent accuracy and prediction capability than other methods included and tested in the study.

Authors and Affiliations

T Sivapriya, A. R. Banu Kamal, V. Thavavel

Keywords

Related Articles

Algorithm for Design of Digital Notch Filter Using Simulation

A smooth waveform is generated of low frequency signal can be achieved through the Digital Notch Filter. Noise can be easily eliminated from a speech signal by using a Notch filter. In this paper the design of notch filt...

 Pursuit Reinforcement Competitive Learning: PRCL based Online Clustering with Learning Automata

 A new online clustering method based on not only reinforcement and competitive learning but also pursuit algorithm (Pursuit Reinforcement Competitive Learning: PRCL) as well as learning automata is proposed for rea...

3D Skeleton model derived from Kinect Depth Sensor Camera and its application to walking style quality evaluations

Feature extraction for gait recognition has been created widely. The ancestor for this task is divided into two parts, model based and free-model based. Model-based approaches obtain a set of static or dynamic skeleton p...

 An Information-Theoretic Measure for Face Recognition: Comparison with Structural Similarity

 Automatic recognition of people faces is a challenging problem that has received significant attention from signal processing researchers in recent years. This is due to its several applications in different fields...

 Evolving Software Effort Estimation Models Using Multigene Symbolic Regression Genetic Programming

Software has played an essential role in engineering, economic development, stock market growth and military applications. Mature software industry count on highly predictive software effort estimation models. Correct es...

Download PDF file
  • EP ID EP150921
  • DOI -
  • Views 131
  • Downloads 0

How To Cite

T Sivapriya, A. R. Banu Kamal, V. Thavavel (2012). Imputation And Classification Of Missing Data Using Least Square Support Vector Machines – A New Approach In Dementia Diagnosis. International Journal of Advanced Research in Artificial Intelligence(IJARAI), 1(4), 29-34. https://europub.co.uk/articles/-A-150921