Loading…

Credit Risk Prediction Based on Improved ADASYN Sampling and Optimized LightGBM

A credit risk prediction model named KM-ADASYN-TL-FLLightGBM (KADT-FLightGBM) is proposed in this study. Firstly, to overcome the limitation of traditional sampling methods in dealing with imbalanced datasets, an improved ADASYN sampling with K-means clustering algorithm is constructed. Moreover, th...

Full description

Saved in:
Bibliographic Details
Published in:Journal of social computing 2024-09, Vol.5 (3), p.232-241
Main Authors: Song, Mei, Ma, He, Zhu, Yi, Zhang, Mengdi
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:A credit risk prediction model named KM-ADASYN-TL-FLLightGBM (KADT-FLightGBM) is proposed in this study. Firstly, to overcome the limitation of traditional sampling methods in dealing with imbalanced datasets, an improved ADASYN sampling with K-means clustering algorithm is constructed. Moreover, the Tomek Links method is used to filter the generated samples. Secondly, an utilized an optimized LightGBM algorithm with the Focal Loss is employed to training the model using the datasets obtained by the improved ADASYN sampling. Finally, the comparative analysis between the ensemble model and other different sampling methodologies is conducted on the Lending Club dataset. The results demonstrate that the proposed model effectively minimizes the misclassification of minority classes in credit risk prediction and can be used as a reference for similar studies.
ISSN:2688-5255
2688-5255
DOI:10.23919/JSC.2024.0019