Improving Credit Scorecard Modeling Through Applying Text Analysis

Abstract

In the credit card scoring and loans management, the prediction of the applicant’s future behavior is an important decision support tool and a key factor in reducing the risk of Loan Default. A lot of data mining and classification approaches have been developed for the credit scoring purpose. For the best of our knowledge, building a credit scorecard by analyzing the textual data in the application form has not been explored so far. This paper proposes a comprehensive credit scorecard model technique that improves credit scorecard modeling though employing textual data analysis. This study uses a sample of loan application forms of a financial institution providing loan services in Yemen, which represents a real-world situation of the credit scoring and loan management. The sample contains a set of Arabic textual data attributes defining the applicants. The credit scoring model based on the text mining pre-processing and logistic regression techniques is proposed and evaluated through a comparison with a group of credit scorecard modeling techniques that use only the numeric attributes in the application form. The results show that adding the textual attributes analysis achieves higher classification effectiveness and outperforms the other traditional numerical data analysis techniques.

Authors and Affiliations

Omar Ghailan, Hoda Mokhtar, Osman Hegazy

Keywords

Related Articles

Hyperparameter Optimization in Convolutional Neural Network using Genetic Algorithms

Optimizing hyperparameters in Convolutional Neural Network (CNN) is a tedious problem for many researchers and practitioners. To get hyperparameters with better performance, experts are required to configure a set of hyp...

Agent-Based Co-Modeling of Information Society and Wealth Distribution

With empirical studies suggesting that information technology influence wealth distribution in different ways, and with economic interactions and information technology adoption being two complex phenomena, there is a ne...

A QUADRATIC CONVERGENCE METHOD FOR THE MANAGEMENT EQUILIBRIUM MODEL

In this paper, we study a class of methods for solving the management equilibrium model. We first give an estimate of the error bound for the model, and then, based on the estimate of the error bound, propose a method fo...

AL-S[sup]2[/sup]m: Soft road traffic Signs map for vehicular systems

In this paper, we describe AL-S[sup]2[/sup]m, a roadmap with traffic signs to be used in vehicular systems. AL-S[sup]2[/sup]m is part of a more general system of traffic signs (TSs) management, called AL-S[sup]2[/sup], w...

A Novel Method to Design S-Boxes Based on Key-Dependent Permutation Schemes and its Quality Analysis

S-boxes are used in block ciphers as the important nonlinear components. The nonlinearity provides important protection against linear and differential cryptanalysis. The S-boxes used in encryption process could be chose...

Download PDF file
  • EP ID EP123097
  • DOI 10.14569/IJACSA.2016.070467
  • Views 84
  • Downloads 0

How To Cite

Omar Ghailan, Hoda Mokhtar, Osman Hegazy (2016). Improving Credit Scorecard Modeling Through Applying Text Analysis. International Journal of Advanced Computer Science & Applications, 7(4), 512-517. https://europub.co.uk/articles/-A-123097