A comparative analysis of machine learning algorithms for predicting probabilities of default
By: Adrian Iulian Cristescu, Matteo Giordano
Potential Business Impact:
Helps banks guess if people will repay loans.
Predicting the probability of default (PD) of prospective loans is a critical objective for financial institutions. In recent years, machine learning (ML) algorithms have achieved remarkable success across a wide variety of prediction tasks; yet, they remain relatively underutilised in credit risk analysis. This paper highlights the opportunities that ML algorithms offer to this field by comparing the performance of five predictive models-Random Forests, Decision Trees, XGBoost, Gradient Boosting and AdaBoost-to the predominantly used logistic regression, over a benchmark dataset from Scheule et al. (Credit Risk Analytics: The R Companion). Our findings underscore the strengths and weaknesses of each method, providing valuable insights into the most effective ML algorithms for PD prediction in the context of loan portfolios.
Similar Papers
Enhancing Bankruptcy Prediction of Banks through Advanced Machine Learning Techniques: An Innovative Approach and Analysis
Machine Learning (CS)
Predicts bank failures with 90% accuracy.
A Comparative Study of Machine Learning Techniques for Early Prediction of Diabetes
Machine Learning (CS)
Helps doctors find diabetes early using computers.
Enhancing ML Models Interpretability for Credit Scoring
Machine Learning (CS)
Helps banks predict loan risk with clear rules.