Credit Risk Modeling: Combining Classification And Regression Algorithms to Predict Expected Loss

  • Tim Kreienkamp
  • Andrey Kateshov
Keywords: Basel II, credit risk, LGD, kaggle, gradient boosting, feature selection

Abstract

Credit risk assessment is of paramount importance in the financial industry. Machine learning techniques have been used successfully over the last two decades to predict the probability of loan default (PD). This way, credit decisions can be automated and risk can be reduced significantly. In the more recent parts, intensified regulatory requirements led to the need to include another parameter – loss given default (LGD), the share of the loan which cannot be recovered in case of loan default – in risk models. We aim to build a unified credit risk model by estimating both parameters jointly to estimate expected loss. A large, highdimensional, real world dataset is used to benchmark several combinations of classification, regression and feature selection algorithms. The results indicate that non-linear techniques work especially well to model expected loss.

Downloads

Download data is not yet available.
Published
2014-12-09
How to Cite
KreienkampT. and KateshovA. (2014) “Credit Risk Modeling: Combining Classification And Regression Algorithms to Predict Expected Loss”, Journal of Corporate Finance Research | ISSN: 2073-0438, 8(4), pp. 4-10. doi: 10.17323/j.jcfr.2073-0438.8.4.2014.4-10.
Section
New Research