Loading…

Using machine learning models to identify the risk of depression in middle-aged and older adults with frequent and infrequent nicotine use: A cross-sectional study

Depression is very prevalent in middle-aged and older smokers. Therefore, we aimed to identify the risk of depression among middle-aged and older adults with frequent and infrequent nicotine use, as this is quite necessary for supporting their well-being. This study included a total of 10,821 partic...

Full description

Saved in:
Bibliographic Details
Published in:Journal of affective disorders 2024-12, Vol.367, p.554-561
Main Authors: Qiu, Yuran, Ma, Xu
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Depression is very prevalent in middle-aged and older smokers. Therefore, we aimed to identify the risk of depression among middle-aged and older adults with frequent and infrequent nicotine use, as this is quite necessary for supporting their well-being. This study included a total of 10,821 participants, which were derived from the China Health and Retirement Longitudinal Study Wave 5, 2020 (CHARLS-5). Five machine learning (ML) algorithms were employed. Some metrics were used to evaluate the performance of models, including area under the receiver operating characteristic curve (AUC), positive predictive value (PPV), specificity, accuracy. 10,821 participants (6472 males, 4349 females) had a mean age of 60.47 ± 8.98, with a score of 8.90 ± 6.53 on depression scale. For middle-aged and older adults with frequent nicotine use, random forest (RF) achieved the highest AUC value, PPV and specificity (0.75, 0.74 and 0.88, respectively). For the other group, support vector machines (SVM) showed the highest PPV (0.74), and relatively high accuracy and specificity (0.72 and 0.87, respectively). Feature importance analysis indicated that “dissatisfaction with life” was the most important variable of identifying the risk of depression in the SVM model, while “attitude towards expected life span” was the most important one in the RF model. CHARLS-5 was collected during the COVID-19, so our results may be influenced by the pandemic. This study indicated that certain ML models can ideally identify the risk of depression in middle-aged and older adults, which holds significant value for their health management. •To identify the risk of depression in middle-aged and older adults with frequent and infrequent nicotine use.•Random forest and support vector machines are identified as the two best performing models.•Variables affecting the two models' output differ between two groups.
ISSN:0165-0327
1573-2517
1573-2517
DOI:10.1016/j.jad.2024.08.185