Comparison of machine learning algorithms on consumer credit classification

Download

index.pdf

Date

2019

Author

Koç, Oğuz

Metadata

Show full item record

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Item Usage Stats

263
views

203
downloads

Like other prediction models, credit scoring is a tool used to evaluate the amount of risk associated with applicants or customers. Scoring models identify clients individually as good or bad applicants. They offer statistical odds or probabilities for prediction either the applicant will be default or not in the future. It is beneficial for banks and credit analysts to measure customers' non-payment risk by statistically tested algorithms in many aspects such as reduction in workload and evaluation time. Also, only demanding features that have the most significant impact on credit assessment process in terms of obtaining more explanatory outcomes, emphasizes the benefits mentioned formerly. Today, Machine Learning (ML) algorithms are commonly applied for data analysis in various areas. The algorithms learn how to determine complicated patterns and create smart choices by generating a mathematical model depending on sample dataset without direct programming. In this thesis, a comparative study is performed using Logistic Regression (LR), Support Vector Machine (SVM), Gaussian Naïve Bayes (GNB), Decision Trees (DT), Random Forest (DT), XGBoost (XGB), K-Nearest Neighbors (KNN) and Multilayer Perceptron Neural Network (MLP) algorithms. In addition to these, we strive to achieve more explanatory outcomes in terms of dimentionality with Wrapper Feature Selection (WFS), and investigate its performance in a way of important attributes detection capacity. We also analyze the impact of Grid Search (GS) hyper-parameters optimizing method, and effect of four data transformation techniques Natural Logarithm (LN), Standard, Box-Cox and Min-Max to these algorithms and methods. We compare these cases to determine the most appropriate way for credit classification by considering accuracy, AUC, type I and type II error rates. All measurements are conducted on German and Australian real world consumer credit datasets commonly used in literature.

Subject Keywords

Consumer credit, Consumer credit Information technology., Machine Learning, Credit Classification, German Credit Dataset, Australian Credit Dataset, Data Transformation, Wrapper Feature Selection, Data Mining

URI

http://etd.lib.metu.edu.tr/upload/12623539/index.pdf
https://hdl.handle.net/11511/43610

Collections

Graduate School of Applied Mathematics, Thesis

Suggestions

OpenMETU
Core

Credit scoring methods and accuray ratio İşcanoğlu, Ayşegül; Körezlioğlu, Hayri; Department of Financial Mathematics (2005) The credit scoring with the help of classification techniques provides to take easy and quick decisions in lending. However, no definite consensus has been reached with regard to the best method for credit scoring and in what conditions the methods performs best. Although a huge range of classification techniques has been used in this area, the logistic regression has been seen an important tool and used very widely in studies. This study aims to examine accuracy and bias properties in parameter estimation ...
An Empirical Investigation Of Payment Performance For Consumer Loans In Turkey Özdemir, Özlem (Orta Doğu Teknik Üniversitesi (Ankara, Turkey), 2008-12) This paper explores the relationship between consumer credit clients’ payment performance and some demographic and financial variables. Data to examine this relationship is obtained from the customer records of a private bank in Turkey. A logistic binary regression is used to evaluate the data. Financial variables rather than the demographic characteristics of clients have significant influence on customers’ pay back performance. Thus, the longer the maturity time and the higher the interest rate, the highe...
Application of a rapid methodology for preliminary appraisal of kaolinite deposits Cetin, Mahir Can; Altun, Naci Emre (EDP Sciences; 2016-09-28) An approach that facilitates the mineralogical-compositional analysis and beneficiation-classification procedure was used for fast assessment of the evaluation possibility of kaolin deposits. The approach was applied on two different kaolin deposits from the Aegean region in Turkey. The kaolin samples were characterized using XRD and XRF analyses to determine the key mineralogical characteristics and major components such as Al2O3. The samples were then subjected to the attrition-scrubbing-hydrocycloning pr...
Credit Risk Market and the Recent Loan Profile in the Turkish Banking Sector Özdemir, Özlem (2009-05-01) Very little research has been done on the financial stability implications of credit risk transfer markets. In particular there is a paucity of work considering the interactions between the various credit risk transfer markets or instruments. Regarding credit derivatives, the small number of existing studies can be explained by a lack of quantitative data and by the brief history of the market" (Kiff et al., 2002, page 2). This paper tries to explain the development of credit risk transfer instruments and h...
A classification problem of credit risk rating investigated and solved by optimisation of the ROC curve Kurum, Efsun; Yildirak, Kasirga; Weber, Gerhard Wilhelm (2012-09-01) Estimation of probability of default has considerable importance in risk management applications where default risk is referred to as credit risk. Basel II (Committee on Banking Supervision) proposes a revision to the international capital accord that implies a more prominent role for internal credit risk assessments based on the determination of default probability of borrowers. In our study, we classify borrower firms into rating classes with respect to their default probability. The classification of fir...

Citation Formats

O. Koç, “Comparison of machine learning algorithms on consumer credit classification,” Thesis (M.S.) -- Graduate School of Applied Mathematics. Financial Mathematics., Middle East Technical University, 2019.