Loading…
Credit Risk Prediction Based on Improved ADASYN Sampling and Optimized LightGBM
A credit risk prediction model named KM-ADASYN-TL-FLLightGBM (KADT-FLightGBM) is proposed in this study. Firstly, to overcome the limitation of traditional sampling methods in dealing with imbalanced datasets, an improved ADASYN sampling with K-means clustering algorithm is constructed. Moreover, th...
Saved in:
Published in: | Journal of social computing 2024-09, Vol.5 (3), p.232-241 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | A credit risk prediction model named KM-ADASYN-TL-FLLightGBM (KADT-FLightGBM) is proposed in this study. Firstly, to overcome the limitation of traditional sampling methods in dealing with imbalanced datasets, an improved ADASYN sampling with K-means clustering algorithm is constructed. Moreover, the Tomek Links method is used to filter the generated samples. Secondly, an utilized an optimized LightGBM algorithm with the Focal Loss is employed to training the model using the datasets obtained by the improved ADASYN sampling. Finally, the comparative analysis between the ensemble model and other different sampling methodologies is conducted on the Lending Club dataset. The results demonstrate that the proposed model effectively minimizes the misclassification of minority classes in credit risk prediction and can be used as a reference for similar studies. |
---|---|
ISSN: | 2688-5255 2688-5255 |
DOI: | 10.23919/JSC.2024.0019 |