Clustering Analysis for Credit Default Probabilities in a Retail Bank Portfolio
Journal Title: Database Systems Journal - Year 2012, Vol 3, Issue 2
Abstract
Methods underlying cluster analysis are very useful in data analysis, especially when the processed volume of data is very large, so that it becomes impossible to extract essential information, unless specific instruments are used to summarize and structure the gross information. In this context, cluster analysis techniques are used particularly, for systematic information analysis. The aim of this article is to build an useful model for banking field, based on data mining techniques, by dividing the groups of borrowers into clusters, in order to obtain a profile of the customers (debtors and good payers). We assume that a class is appropriate if it contains members that have a high degree of similarity and the standard method for measuring the similarity within a group shows the lowest variance. After clustering, data mining techniques are implemented on the cluster with bad debtors, reaching a very high accuracy after implementation. The paper is structured as follows: Section 2 describes the model for data analysis based on a specific scoring model that we proposed. In section 3, we present a cluster analysis using K-means algorithm and the DM models are applied on a specific cluster. Section 4 shows the conclusions.
Authors and Affiliations
Adela Tudor, Elena Andrei Dragomir
Column-Oriented Databases, an Alternative for Analytical Environment
It is widely accepted that a data warehouse is the central place of a Business Intelligence system. It stores all data that is relevant for the company, data that is acquired both from internal and external sources. Such...
Analysis on Cloud Computing Database in Cloud Environment – Concept and Adoption Paradigm
With the development of the Internet’s new technical functionalities, new concepts have started to take shape. These concepts have an important role especially in the development of corporate IT. Such a concept is „the C...
PL/SQL and Bind Variable: the two ways to increase the efficiency of Network Databases
Modern data analysis applications are driven by the Network databases. They are pushing traditional database and data warehousing technologies beyond their limits due to their massively increasing data volumes and...
Perspectives on the Role of Business Rules in Database Design
Business rules are at the foundation of every information system as they drive and offer guidelines for managing and conducting all activities within an organizations. They are important both for operational system...
Applications of Spatial Data Using Business Analytics Tools
This paper addresses the possibilities of using spatial data in business analytics tools, with emphasis on SAS software. Various kinds of map data sets containing spatial data are presented and discussed. Examples...