Loading…
Enhancing accuracy and interpretability of ensemble strategies in credit risk assessment. A correlated-adjusted decision forest proposal
•CADF deals with multiple diversity sources to balance accuracy and interpretability.•To enhance diversity, CADF merges different correlated-adjusted decision trees.•Results suggest CADF can compete in accuracy with much more complex ensemble models.•Superior accuracy of CADF is tested through diffe...
Saved in:
Published in: | Expert systems with applications 2015-08, Vol.42 (13), p.5737-5753 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c399t-8c083e1f6ad28e0b1ae928a9eef394aba0aa0dea1544a366953b59d04503e3853 |
---|---|
cites | cdi_FETCH-LOGICAL-c399t-8c083e1f6ad28e0b1ae928a9eef394aba0aa0dea1544a366953b59d04503e3853 |
container_end_page | 5753 |
container_issue | 13 |
container_start_page | 5737 |
container_title | Expert systems with applications |
container_volume | 42 |
creator | Florez-Lopez, Raquel Ramon-Jeronimo, Juan Manuel |
description | •CADF deals with multiple diversity sources to balance accuracy and interpretability.•To enhance diversity, CADF merges different correlated-adjusted decision trees.•Results suggest CADF can compete in accuracy with much more complex ensemble models.•Superior accuracy of CADF is tested through different measures and statistical tests.•Oppositely to ‘black-box’ models, CADF produces logical, human understanding rules.
Credit risk assessment is a critical topic for finance activity and bankruptcy prediction that has been broadly explored using statistical models and Machine Learning methods. Recently, studies have suggested the use of ensemble strategies to enhance credit modelling performance. However, accuracy is obtained at the expense of interpretability, leading to the reluctance of financial industry to employ ensemble models in favour of simpler models. In this work we introduce an ensemble approach based on merged decision trees, the correlated-adjusted decision forest (CADF), to produce both accurate and comprehensible models. As main innovation, our proposal explores the combination of complementary sources of diversity as mechanisms to optimise model’s structure, which leads to a manageable number of comprehensive decision rules without sacrificing performance. We evaluate our approach in comparison to individual classifiers and alternative ensemble strategies (gradient boosting, random forests). Empirical results suggest CADF is an encouraging solution for credit risk problems, being able to compete in accuracy with much complex proposals while producing a rule-based structure directly useful for managerial decisions. |
doi_str_mv | 10.1016/j.eswa.2015.02.042 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1825458095</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417415001499</els_id><sourcerecordid>1825458095</sourcerecordid><originalsourceid>FETCH-LOGICAL-c399t-8c083e1f6ad28e0b1ae928a9eef394aba0aa0dea1544a366953b59d04503e3853</originalsourceid><addsrcrecordid>eNp9kL-O1DAQhy0EEsvBC1C5pEkYx3ESSzSn03EgnUQDtTWxJ4eXrLN4vKB9Ax4br5aaaqb4fvPnE-KtglaBGt7vW-Lf2HagTAtdC333TOzUNOpmGK1-LnZgzdj0auxfilfMewA1Aow78ec-fcfkY3qS6P0poz9LTEHGVCgfMxWc4xrLWW6LpMR0mFeSXDIWeorElZM-U4hF5sg_JDIT84FSaeWt9FvOtFY0NBj2J66NDOQjxy3JZcvERR7zdtwY19fixYIr05t_9UZ8-3j_9e5T8_jl4fPd7WPjtbWlmTxMmtQyYOgmglkh2W5CS7Ro2-OMgAiBUJm-Rz0M1ujZ2AC9AU16MvpGvLvOrYt_nuoF7hDZ07piou3ETk2d6c1UfVW0u6I-b8yZFnfM8YD57BS4i3a3dxft7qLdQeeq9hr6cA1RfeJXpOzYR0q-Osrkiwtb_F_8L1-Fj3E</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1825458095</pqid></control><display><type>article</type><title>Enhancing accuracy and interpretability of ensemble strategies in credit risk assessment. A correlated-adjusted decision forest proposal</title><source>ScienceDirect Freedom Collection 2022-2024</source><creator>Florez-Lopez, Raquel ; Ramon-Jeronimo, Juan Manuel</creator><creatorcontrib>Florez-Lopez, Raquel ; Ramon-Jeronimo, Juan Manuel</creatorcontrib><description>•CADF deals with multiple diversity sources to balance accuracy and interpretability.•To enhance diversity, CADF merges different correlated-adjusted decision trees.•Results suggest CADF can compete in accuracy with much more complex ensemble models.•Superior accuracy of CADF is tested through different measures and statistical tests.•Oppositely to ‘black-box’ models, CADF produces logical, human understanding rules.
Credit risk assessment is a critical topic for finance activity and bankruptcy prediction that has been broadly explored using statistical models and Machine Learning methods. Recently, studies have suggested the use of ensemble strategies to enhance credit modelling performance. However, accuracy is obtained at the expense of interpretability, leading to the reluctance of financial industry to employ ensemble models in favour of simpler models. In this work we introduce an ensemble approach based on merged decision trees, the correlated-adjusted decision forest (CADF), to produce both accurate and comprehensible models. As main innovation, our proposal explores the combination of complementary sources of diversity as mechanisms to optimise model’s structure, which leads to a manageable number of comprehensive decision rules without sacrificing performance. We evaluate our approach in comparison to individual classifiers and alternative ensemble strategies (gradient boosting, random forests). Empirical results suggest CADF is an encouraging solution for credit risk problems, being able to compete in accuracy with much complex proposals while producing a rule-based structure directly useful for managerial decisions.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2015.02.042</identifier><language>eng</language><publisher>Elsevier Ltd</publisher><subject>Accuracy ; Correlation ; Credit scoring ; Decision forests ; Decision trees ; Diversity ; Ensemble strategies ; Gradient boosting ; Mathematical models ; Proposals ; Random forests ; Reluctance ; Risk assessment ; Strategy</subject><ispartof>Expert systems with applications, 2015-08, Vol.42 (13), p.5737-5753</ispartof><rights>2015 Elsevier Ltd</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c399t-8c083e1f6ad28e0b1ae928a9eef394aba0aa0dea1544a366953b59d04503e3853</citedby><cites>FETCH-LOGICAL-c399t-8c083e1f6ad28e0b1ae928a9eef394aba0aa0dea1544a366953b59d04503e3853</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Florez-Lopez, Raquel</creatorcontrib><creatorcontrib>Ramon-Jeronimo, Juan Manuel</creatorcontrib><title>Enhancing accuracy and interpretability of ensemble strategies in credit risk assessment. A correlated-adjusted decision forest proposal</title><title>Expert systems with applications</title><description>•CADF deals with multiple diversity sources to balance accuracy and interpretability.•To enhance diversity, CADF merges different correlated-adjusted decision trees.•Results suggest CADF can compete in accuracy with much more complex ensemble models.•Superior accuracy of CADF is tested through different measures and statistical tests.•Oppositely to ‘black-box’ models, CADF produces logical, human understanding rules.
Credit risk assessment is a critical topic for finance activity and bankruptcy prediction that has been broadly explored using statistical models and Machine Learning methods. Recently, studies have suggested the use of ensemble strategies to enhance credit modelling performance. However, accuracy is obtained at the expense of interpretability, leading to the reluctance of financial industry to employ ensemble models in favour of simpler models. In this work we introduce an ensemble approach based on merged decision trees, the correlated-adjusted decision forest (CADF), to produce both accurate and comprehensible models. As main innovation, our proposal explores the combination of complementary sources of diversity as mechanisms to optimise model’s structure, which leads to a manageable number of comprehensive decision rules without sacrificing performance. We evaluate our approach in comparison to individual classifiers and alternative ensemble strategies (gradient boosting, random forests). Empirical results suggest CADF is an encouraging solution for credit risk problems, being able to compete in accuracy with much complex proposals while producing a rule-based structure directly useful for managerial decisions.</description><subject>Accuracy</subject><subject>Correlation</subject><subject>Credit scoring</subject><subject>Decision forests</subject><subject>Decision trees</subject><subject>Diversity</subject><subject>Ensemble strategies</subject><subject>Gradient boosting</subject><subject>Mathematical models</subject><subject>Proposals</subject><subject>Random forests</subject><subject>Reluctance</subject><subject>Risk assessment</subject><subject>Strategy</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><recordid>eNp9kL-O1DAQhy0EEsvBC1C5pEkYx3ESSzSn03EgnUQDtTWxJ4eXrLN4vKB9Ax4br5aaaqb4fvPnE-KtglaBGt7vW-Lf2HagTAtdC333TOzUNOpmGK1-LnZgzdj0auxfilfMewA1Aow78ec-fcfkY3qS6P0poz9LTEHGVCgfMxWc4xrLWW6LpMR0mFeSXDIWeorElZM-U4hF5sg_JDIT84FSaeWt9FvOtFY0NBj2J66NDOQjxy3JZcvERR7zdtwY19fixYIr05t_9UZ8-3j_9e5T8_jl4fPd7WPjtbWlmTxMmtQyYOgmglkh2W5CS7Ro2-OMgAiBUJm-Rz0M1ujZ2AC9AU16MvpGvLvOrYt_nuoF7hDZ07piou3ETk2d6c1UfVW0u6I-b8yZFnfM8YD57BS4i3a3dxft7qLdQeeq9hr6cA1RfeJXpOzYR0q-Osrkiwtb_F_8L1-Fj3E</recordid><startdate>20150801</startdate><enddate>20150801</enddate><creator>Florez-Lopez, Raquel</creator><creator>Ramon-Jeronimo, Juan Manuel</creator><general>Elsevier Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20150801</creationdate><title>Enhancing accuracy and interpretability of ensemble strategies in credit risk assessment. A correlated-adjusted decision forest proposal</title><author>Florez-Lopez, Raquel ; Ramon-Jeronimo, Juan Manuel</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c399t-8c083e1f6ad28e0b1ae928a9eef394aba0aa0dea1544a366953b59d04503e3853</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Accuracy</topic><topic>Correlation</topic><topic>Credit scoring</topic><topic>Decision forests</topic><topic>Decision trees</topic><topic>Diversity</topic><topic>Ensemble strategies</topic><topic>Gradient boosting</topic><topic>Mathematical models</topic><topic>Proposals</topic><topic>Random forests</topic><topic>Reluctance</topic><topic>Risk assessment</topic><topic>Strategy</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Florez-Lopez, Raquel</creatorcontrib><creatorcontrib>Ramon-Jeronimo, Juan Manuel</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Florez-Lopez, Raquel</au><au>Ramon-Jeronimo, Juan Manuel</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Enhancing accuracy and interpretability of ensemble strategies in credit risk assessment. A correlated-adjusted decision forest proposal</atitle><jtitle>Expert systems with applications</jtitle><date>2015-08-01</date><risdate>2015</risdate><volume>42</volume><issue>13</issue><spage>5737</spage><epage>5753</epage><pages>5737-5753</pages><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>•CADF deals with multiple diversity sources to balance accuracy and interpretability.•To enhance diversity, CADF merges different correlated-adjusted decision trees.•Results suggest CADF can compete in accuracy with much more complex ensemble models.•Superior accuracy of CADF is tested through different measures and statistical tests.•Oppositely to ‘black-box’ models, CADF produces logical, human understanding rules.
Credit risk assessment is a critical topic for finance activity and bankruptcy prediction that has been broadly explored using statistical models and Machine Learning methods. Recently, studies have suggested the use of ensemble strategies to enhance credit modelling performance. However, accuracy is obtained at the expense of interpretability, leading to the reluctance of financial industry to employ ensemble models in favour of simpler models. In this work we introduce an ensemble approach based on merged decision trees, the correlated-adjusted decision forest (CADF), to produce both accurate and comprehensible models. As main innovation, our proposal explores the combination of complementary sources of diversity as mechanisms to optimise model’s structure, which leads to a manageable number of comprehensive decision rules without sacrificing performance. We evaluate our approach in comparison to individual classifiers and alternative ensemble strategies (gradient boosting, random forests). Empirical results suggest CADF is an encouraging solution for credit risk problems, being able to compete in accuracy with much complex proposals while producing a rule-based structure directly useful for managerial decisions.</abstract><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2015.02.042</doi><tpages>17</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0957-4174 |
ispartof | Expert systems with applications, 2015-08, Vol.42 (13), p.5737-5753 |
issn | 0957-4174 1873-6793 |
language | eng |
recordid | cdi_proquest_miscellaneous_1825458095 |
source | ScienceDirect Freedom Collection 2022-2024 |
subjects | Accuracy Correlation Credit scoring Decision forests Decision trees Diversity Ensemble strategies Gradient boosting Mathematical models Proposals Random forests Reluctance Risk assessment Strategy |
title | Enhancing accuracy and interpretability of ensemble strategies in credit risk assessment. A correlated-adjusted decision forest proposal |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T02%3A28%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Enhancing%20accuracy%20and%20interpretability%20of%20ensemble%20strategies%20in%20credit%20risk%20assessment.%20A%20correlated-adjusted%20decision%20forest%20proposal&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Florez-Lopez,%20Raquel&rft.date=2015-08-01&rft.volume=42&rft.issue=13&rft.spage=5737&rft.epage=5753&rft.pages=5737-5753&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2015.02.042&rft_dat=%3Cproquest_cross%3E1825458095%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c399t-8c083e1f6ad28e0b1ae928a9eef394aba0aa0dea1544a366953b59d04503e3853%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1825458095&rft_id=info:pmid/&rfr_iscdi=true |