Loading…

Predicting the depression in university students using stacking ensemble techniques over oversampling method

Depression is that mental health disorder characterized by constant sadness for approximately 2 weeks, in which it generates an inability to do daily activities, and those affected lose interest in doing the things they previously enjoyed. About 1 billion people have mental disorders and more than 3...

Full description

Saved in:
Bibliographic Details
Published in:Informatics in medicine unlocked 2023, Vol.41, p.101295, Article 101295
Main Authors: Daza Vergaray, Alfredo, Miranda, Juan Carlos Herrera, Cornelio, Juana Bobadilla, López Carranza, Atilio Rubén, Ponce Sánchez, Carlos Fidel
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Depression is that mental health disorder characterized by constant sadness for approximately 2 weeks, in which it generates an inability to do daily activities, and those affected lose interest in doing the things they previously enjoyed. About 1 billion people have mental disorders and more than 300 million people have depression globally. To predict depression, the use of machine learning techniques is essential, being helpful in obtaining automatic processes and creating models that help analyze and solve a problem. The objective of the study was to propose a method and 3 combined models based on Stacking to predict depression in university students of a public university. The dataset was composed of Computer and Systems Engineering students from a public university (n = 284). Then cleaning and pre-processing was performed, where the data was reviewed using the Python program. In the balancing of the data, the data were divided into 5 values obtained and the oversampling method was performed, distributing the data according to the condition. Then we proceeded to partition the balanced data, while using the Cross validation method for data training. For the model and evaluation, 4 independent algorithms were used, and based on these 3 combined models were proposed. Of the proposed combined models Ensemble Stacking 1 and Stacking 2 achieved the best Accuracy and ROC Curve score -micro and score-macro with 94.69% and 100.00%. In the same way with respect to sensitivity, Stacking 1 obtained the best sensitivity, accuracy and F1-Score, these being 94.22%, 94.09% and 94.12% respectively. This study emphasizes the application of the Ensemble Stacking method to detect depression early in students of a public university in Peru. With this technology, when using the combined method, it was possible to observe an improvement in the performance of the process for the prediction of depression, unlike performing it with independent algorithms.
ISSN:2352-9148
2352-9148
DOI:10.1016/j.imu.2023.101295