Loading…
Two credit scoring models based on dual strategy ensemble trees
► Two dual strategy ensemble trees are proposed to reduce the influences of noise data and redundant attributes. ► The comprehensive experimental evaluations are conducted to validate the effectiveness of proposed methods. ► RS-Bagging DT and Bagging-RS DT can be used as alternative techniques for c...
Saved in:
Published in: | Knowledge-based systems 2012-02, Vol.26, p.61-68 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | ► Two dual strategy ensemble trees are proposed to reduce the influences of noise data and redundant attributes. ► The comprehensive experimental evaluations are conducted to validate the effectiveness of proposed methods. ► RS-Bagging DT and Bagging-RS DT can be used as alternative techniques for credit scoring.
Decision tree (DT) is one of the most popular classification algorithms in data mining and machine learning. However, the performance of DT based credit scoring model is often relatively poorer than other techniques. This is mainly due to two reasons: DT is easily affected by (1) the noise data and (2) the redundant attributes of data under the circumstance of credit scoring. In this study, we propose two dual strategy ensemble trees: RS-Bagging DT and Bagging-RS DT, which are based on two ensemble strategies: bagging and random subspace, to reduce the influences of the noise data and the redundant attributes of data and to get the relatively higher classification accuracy. Two real world credit datasets are selected to demonstrate the effectiveness and feasibility of proposed methods. Experimental results reveal that single DT gets the lowest average accuracy among five single classifiers, i.e., Logistic Regression Analysis (LRA), Linear Discriminant Analysis (LDA), Multi-layer Perceptron (MLP) and Radial Basis Function Network (RBFN). Moreover, RS-Bagging DT and Bagging-RS DT get the better results than five single classifiers and four popular ensemble classifiers, i.e., Bagging DT, Random Subspace DT, Random Forest and Rotation Forest. The results show that RS-Bagging DT and Bagging-RS DT can be used as alternative techniques for credit scoring. |
---|---|
ISSN: | 0950-7051 1872-7409 |
DOI: | 10.1016/j.knosys.2011.06.020 |