Extracting Credit Rules from Imbalanced Data: The Case of an Iranian Export Development Bank

Journal Title: Journal of Information Systems and Telecommunication - Year 2015, Vol 3, Issue 1

Abstract

Credit scoring is an important topic, and banks collect different data from their loan applicant to make an appropriate and correct decision. Rule bases are of more attention in credit decision making because of their ability to explicitly distinguish between good and bad applicants. The credit scoring datasets are usually imbalanced. This is mainly because the number of good applicants in a portfolio of loan is usually much higher than the number of loans that default. This paper use previous applied rule bases in credit scoring, including RIPPER, OneR, Decision table, PART and C4.5 to study the reliability and results of sampling on its own dataset. A real database of one of an Iranian export development bank is used and, imbalanced data issues are investigated by randomly Oversampling the minority class of defaulters, and three times under sampling of majority of non-defaulters class. The performance criterion chosen to measure the reliability of rule extractors is the area under the receiver operating characteristic curve (AUC), accuracy and number of rules. Friedman’s statistic is used to test for significance differences between techniques and datasets. The results from study show that PART is better and good and bad samples of data affect its results less.

Authors and Affiliations

Seyed Mahdi Sadatrasoul, Mohammad Reza Gholamian, Kamran Shahanaghi

Keywords

Related Articles

Statistical Analysis of Different Traffic Types Effect on QoS of Wireless Ad Hoc Networks

IEEE 802.11 based wireless ad hoc networks are highly appealing owing to their needless of infrastructures, ease and quick deployment and high availability. Vast variety of applications such as voice and video transmissi...

A new Sparse Coding Approach for Human Face and Action Recognition

Sparse coding is an unsupervised method which learns a set of over-complete bases to represent data such as image, video and etc. In the cases where we have some similar images from the different classes, using the spars...

Unsupervised Segmentation of Retinal Blood Vessels Using the Human Visual System Line Detection Model

Retinal image assessment has been employed by the medical community for diagnosing vascular and non-vascular pathology. Computer based analysis of blood vessels in retinal images will help ophthalmologists monitor larger...

An Effective Risk Computation Metric for Android Malware Detection

Android has been targeted by malware developers since it has emerged as widest used operating system for smartphones and mobile devices. Android security mainly relies on user decisions regarding to installing applicatio...

A New Finite Field Multiplication Algorithm to Improve Elliptic Curve Cryptosystem Implementations

This paper presents a new and efficient implementation approach for the elliptic curve cryptosystem (ECC) based on a novel finite field multiplication in GF(2m) and an efficient scalar multiplication algorithm. This new...

Download PDF file
  • EP ID EP184747
  • DOI 10.7508/jist.2015.01.004
  • Views 107
  • Downloads 0

How To Cite

Seyed Mahdi Sadatrasoul, Mohammad Reza Gholamian, Kamran Shahanaghi (2015). Extracting Credit Rules from Imbalanced Data: The Case of an Iranian Export Development Bank. Journal of Information Systems and Telecommunication, 3(1), 22-28. https://europub.co.uk/articles/-A-184747