A Rank Aggregation Algorithm for Ensemble of Multiple Feature Selection Techniques in Credit Risk Evaluation

Abstract

 In credit risk evaluation the accuracy of a classifier is very significant for classifying the high-risk loan applicants correctly. Feature selection is one way of improving the accuracy of a classifier. It provides the classifier with important and relevant features for model development. This study uses the ensemble of multiple feature ranking techniques for feature selection of credit data. It uses five individual rank based feature selection methods. It proposes a novel rank aggregation algorithm for combining the ranks of the individual feature selection methods of the ensemble. This algorithm uses the rank order along with the rank score of the features in the ranked list of each feature selection method for rank aggregation. The ensemble of multiple feature selection techniques uses the novel rank aggregation algorithm and selects the relevant features using the 80%, 60%, 40% and 20% thresholds from the top of the aggregated ranked list for building the C4.5, MLP, C4.5 based Bagging and MLP based Bagging models. It was observed that the performance of models using the ensemble of multiple feature selection techniques is better than the performance of 5 individual rank based feature selection methods. The average performance of all the models was observed as best for the ensemble of feature selection techniques at 60% threshold. Also, the bagging based models outperformed the individual models most significantly for the 60% threshold. This increase in performance is more significant from the fact that the number of features were reduced by 40% for building the highest performing models. This reduces the data dimensions and hence the overall data size phenomenally for model building. The use of the ensemble of feature selection techniques using the novel aggregation algorithm provided more accurate models which are simpler, faster and easy to interpret.

Authors and Affiliations

Shashi Dahiya, S. S Handa, N. P Singh

Keywords

Related Articles

Evacuation Path Selection for Firefighters Based on Dynamic Triangular Network Model

Path selection is one of the critical aspects in emergency evacuation. In a fire scene, how to choose an optimal evacuation path for firefighters is a challenging aspect. In this paper, firstly, a dynamic triangular netw...

 A Fuzzy Approach to Classify Learning Disability

 The endeavor of this work is to support the special education community in their quest to be with the mainstream. The initial segment of the paper gives an exhaustive study of the different mechanisms of diagnosing...

Texture Based Image Retrieval Using Framelet Transform–Gray Level Co-occurrence Matrix(GLCM)

This paper presents a novel content based image retrieval (CBIR) system based on Framelet Transform combined with gray level co-occurrence matrix (GLCM).The proposed method is shift invariant which captured edge informat...

Spatial Metrics based Landscape Structure and Dynamics Assessment for an emerging Indian Megalopolis

Human-induced land use changes are considered the prime agents of the global environmental changes. Urbanisation and associated growth patterns (urban sprawl) are characteristic of spatial temporal changes that take plac...

Comparative study between the proposed shape independent clustering method and the conventional methods (K-means and the other)

 Cluster analysis aims at identifying groups of similar objects and, therefore helps to discover distribution of patterns and interesting correlations in the data sets. In this paper, we propose to provide a consist...

Download PDF file
  • EP ID EP123358
  • DOI 10.14569/IJARAI.2016.050901
  • Views 135
  • Downloads 0

How To Cite

Shashi Dahiya, S. S Handa, N. P Singh (2016).  A Rank Aggregation Algorithm for Ensemble of Multiple Feature Selection Techniques in Credit Risk Evaluation. International Journal of Advanced Research in Artificial Intelligence(IJARAI), 5(9), 1-8. https://europub.co.uk/articles/-A-123358