A Rank Aggregation Algorithm for Ensemble of Multiple Feature Selection Techniques in Credit Risk Evaluation

Abstract

 In credit risk evaluation the accuracy of a classifier is very significant for classifying the high-risk loan applicants correctly. Feature selection is one way of improving the accuracy of a classifier. It provides the classifier with important and relevant features for model development. This study uses the ensemble of multiple feature ranking techniques for feature selection of credit data. It uses five individual rank based feature selection methods. It proposes a novel rank aggregation algorithm for combining the ranks of the individual feature selection methods of the ensemble. This algorithm uses the rank order along with the rank score of the features in the ranked list of each feature selection method for rank aggregation. The ensemble of multiple feature selection techniques uses the novel rank aggregation algorithm and selects the relevant features using the 80%, 60%, 40% and 20% thresholds from the top of the aggregated ranked list for building the C4.5, MLP, C4.5 based Bagging and MLP based Bagging models. It was observed that the performance of models using the ensemble of multiple feature selection techniques is better than the performance of 5 individual rank based feature selection methods. The average performance of all the models was observed as best for the ensemble of feature selection techniques at 60% threshold. Also, the bagging based models outperformed the individual models most significantly for the 60% threshold. This increase in performance is more significant from the fact that the number of features were reduced by 40% for building the highest performing models. This reduces the data dimensions and hence the overall data size phenomenally for model building. The use of the ensemble of feature selection techniques using the novel aggregation algorithm provided more accurate models which are simpler, faster and easy to interpret.

Authors and Affiliations

Shashi Dahiya, S. S Handa, N. P Singh

Keywords

Related Articles

Optimisation of Resource Scheduling in VCIM Systems Using Genetic Algorithm

The concept of Virtual Computer-Integrated Manufacturing (VCIM) has been proposed for one and a half decade with purpose of overcoming the limitation of traditional Computer-Integrated Manufacturing (CIM) as it only work...

 Thresholding Based Method for Rainy Cloud Detection with NOAA/AVHRR Data by Means of Jacobi Itteration Method

 Thresholding based method for rainy cloud detection with NOAA/AVHRR data by means of Jacobi iteration method is proposed. Attempts of the proposed method are made through comparisons to truth data which are provide...

 Mobile Device Based Personalized Equalizer for Improving Hearing Capability of Human Voices in Particular for Elderly Persons

 Mobile device based personalized equalizer for improving the hearing capability of human voices in particular for elderly persons are proposed. Through experiments, it is found that the proposed equalizer does work...

Bi-Directional Reflectance Distribution Function: BRDF Effect on Un-mixing, Category Decomposition of the Mixed Pixel (MIXEL) of Remote Sensing Satellite Imagery Data

Method for unmixing, category decomposition of the mixed pixel (MIXEL) of remote sensing satellite imagery data taking into account the effect due to Bi-Directional Reflectance Distribution Function: BRDF is proposed. Al...

 Recognition of Similar Wooden Surfaces with a Hierarchical Neural Network Structure

 The surface quality assurance check is an important task in industrial production of wooden parts. There are many automated systems applying different methods for preprocessing and recognition/classification of sur...

Download PDF file
  • EP ID EP123358
  • DOI 10.14569/IJARAI.2016.050901
  • Views 122
  • Downloads 0

How To Cite

Shashi Dahiya, S. S Handa, N. P Singh (2016).  A Rank Aggregation Algorithm for Ensemble of Multiple Feature Selection Techniques in Credit Risk Evaluation. International Journal of Advanced Research in Artificial Intelligence(IJARAI), 5(9), 1-8. https://europub.co.uk/articles/-A-123358