Imputation And Classification Of Missing Data Using Least Square Support Vector Machines – A New Approach In Dementia Diagnosis

Abstract

This paper presents a comparison of different data imputation approaches used in filling missing data and proposes a combined approach to estimate accurately missing attribute values in a patient database. The present study suggests a more robust technique that is likely to supply a value closer to the one that is missing for effective classification and diagnosis. Initially data is clustered and z-score method is used to select possible values of an instance with missing attribute values. Then multiple imputation method using LSSVM (Least Squares Support Vector Machine) is applied to select the most appropriate values for the missing attributes. Five imputed datasets have been used to demonstrate the performance of the proposed method. Experimental results show that our method outperforms conventional methods of multiple imputation and mean substitution. Moreover, the proposed method CZLSSVM (Clustered Z-score Least Square Support Vector Machine) has been evaluated in two classification problems for incomplete data. The efficacy of the imputation methods have been evaluated using LSSVM classifier. Experimental results indicate that accuracy of the classification is increases with CZLSSVM in the case of missing attribute value estimation. It is found that CZLSSVM outperforms other data imputation approaches like decision tree, rough sets and artificial neural networks, K-NN (K-Nearest Neighbour) and SVM. Further it is observed that CZLSSVM yields 95 per cent accuracy and prediction capability than other methods included and tested in the study.

Authors and Affiliations

T Sivapriya, A. R. Banu Kamal, V. Thavavel

Keywords

Related Articles

 Method for Car in Dangerous Action Detection by Means of Wavelet Multi Resolution Analysis Based on Appropriate Support Length of Base Function

 Multi-Resolution Analysis: MRA based on the mother wavelet function with which support length differs from the image of the automobile rear under run is performed, and the run characteristic of a car is searched fo...

 Realising Dynamism in MediaSense Publish/Subscribe Model for Logical-Clustering in Crowdsourcing

 The upsurge of social networks, mobile devices, Internet or Web-enabled services have enabled unprecedented level of human participation in pervasive computing which is coined as crowdsourcing. The pervasiveness of...

Provenance and Temporally Annotated Logic Programming

In this paper, we consider provenance and temporally annotated logic rules (pt-logic rules, for short), which are definite logic programming rules associated with the name of the source that they originate and the tempor...

 Prediction of New Student Numbers using Least Square Method

 STMIK BANJARBARU has acquired less number of new students for the last three years compared to the previous years. The numbers of new student acquisition are not always the same every year. The unstable number of n...

 Regressive Analysis on Leaf Nitrogen Content and Near Infrared Reflectance and Its Application for Agricultural Farm Monitoring with Helicopter Mounted Near Infrared Camera

 Method for evaluation of nitrogen richness of tealeaves with near infrared reflectance is proposed. Also tea farm monitoring with helicopter mounted near infrared camera is proposed. Through experiments and regress...

Download PDF file
  • EP ID EP150921
  • DOI -
  • Views 104
  • Downloads 0

How To Cite

T Sivapriya, A. R. Banu Kamal, V. Thavavel (2012). Imputation And Classification Of Missing Data Using Least Square Support Vector Machines – A New Approach In Dementia Diagnosis. International Journal of Advanced Research in Artificial Intelligence(IJARAI), 1(4), 29-34. https://europub.co.uk/articles/-A-150921