Loading…
The application of brute force logistic regression to corporate credit scoring models: Evidence from Serbian financial statements
•A model for real-life credit scoring application has been developed using brute force logistic regression.•A thorough investigation of most predictive financial ratios has been performed.•A clustering technique for reducing the number of highly-correlated variables has been applied.•A model with hi...
Saved in:
Published in: | Expert systems with applications 2013-11, Vol.40 (15), p.5932-5944 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | •A model for real-life credit scoring application has been developed using brute force logistic regression.•A thorough investigation of most predictive financial ratios has been performed.•A clustering technique for reducing the number of highly-correlated variables has been applied.•A model with highest predictive power has been proposed.
In this paper a brute force logistic regression (LR) modeling approach is proposed and used to develop predictive credit scoring model for corporate entities. The modeling is based on 5years of data from end-of-year financial statements of Serbian corporate entities, as well as, default event data. To the best of our knowledge, so far no relevant research about predictive power of financial ratios derived from Serbian financial statements has been published. This is also the first paper that generated 350 financial ratios to represent independent variables for 7590 corporate entities default predictions’. Many of derived financial ratios are new and were not discussed in literature before. Weight of evidence (WOE) method has been applied to transform and prepare financial ratios for brute force LR fitting simulations. Clustering method has been utilized to reduce long list of variables and to remove highly correlated financial ratios from partitioned training and validation datasets. The clustering results have revealed that number of variables can be reduced to short list of 24 financial ratios which are then analyzed in terms of default event predictive power. In this paper we propose the most predictive financial ratios from financial statements of Serbian corporate entities. The obtained short list of financial ratios has been used as a main input for brute force LR model simulations. According to literature, common practice to select variables in final model is to run stepwise, forward or backward LR. However, this research has been conducted in a way that the brute force LR simulations have to obtain all possible combinations of models that comprise of 5–14 independent variables from the short list of 24 financial ratios. The total number of simulated resulting LR models is around 14 million. Each model has been fitted through extensive and time consuming brute force LR simulations using SAS® code written by the authors. The total number of 342,016 simulated models (“well-founded” models) has satisfied the established credit scoring model validity conditions. The well-founded models have been ranked according to GI |
---|---|
ISSN: | 0957-4174 1873-6793 |
DOI: | 10.1016/j.eswa.2013.05.022 |