Loading…

Machine learning models for prediction of double and triple burdens of non-communicable diseases in Bangladesh

Increasing prevalence of non-communicable diseases (NCDs) has become the leading cause of death and disability in Bangladesh. Therefore, this study aimed to measure the prevalence of and risk factors for double and triple burden of NCDs (DBNCDs and TBNCDs), considering diabetes, hypertension, and ov...

Full description

Saved in:
Bibliographic Details
Published in:Journal of biosocial science 2024-05, Vol.56 (3), p.426-444
Main Authors: Al-Zubayer, Md Akib, Alam, Khorshed, Shanto, Hasibul Hasan, Maniruzzaman, Md, Majumder, Uttam Kumar, Ahammed, Benojir
Format: Article
Language:English
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c253t-ad7a9d294565a765c2eb5cbfcf2ec90606dacf8309322892333135065d9123c73
container_end_page 444
container_issue 3
container_start_page 426
container_title Journal of biosocial science
container_volume 56
creator Al-Zubayer, Md Akib
Alam, Khorshed
Shanto, Hasibul Hasan
Maniruzzaman, Md
Majumder, Uttam Kumar
Ahammed, Benojir
description Increasing prevalence of non-communicable diseases (NCDs) has become the leading cause of death and disability in Bangladesh. Therefore, this study aimed to measure the prevalence of and risk factors for double and triple burden of NCDs (DBNCDs and TBNCDs), considering diabetes, hypertension, and overweight and obesity as well as establish a machine learning approach for predicting DBNCDs and TBNCDs. A total of 12,151 respondents from the 2017 to 2018 Bangladesh Demographic and Health Survey were included in this analysis, where 10%, 27.4%, and 24.3% of respondents had diabetes, hypertension, and overweight and obesity, respectively. Chi-square test and multilevel logistic regression (LR) analysis were applied to select factors associated with DBNCDs and TBNCDs. Furthermore, six classifiers including decision tree (DT), LR, naïve Bayes (NB), k-nearest neighbour (KNN), random forest (RF), and extreme gradient boosting (XGBoost) with three cross-validation protocols (K2, K5, and K10) were adopted to predict the status of DBNCDs and TBNCDs. The classification accuracy (ACC) and area under the curve (AUC) were computed for each protocol and repeated 10 times to make them more robust, and then the average ACC and AUC were computed. The prevalence of DBNCDs and TBNCDs was 14.3% and 2.3%, respectively. The findings of this study revealed that DBNCDs and TBNCDs were significantly influenced by age, sex, marital status, wealth index, education and geographic region. Compared to other classifiers, the RF-based classifier provides the highest ACC and AUC for both DBNCDs (ACC = 81.06% and AUC = 0.93) and TBNCDs (ACC = 88.61% and AUC = 0.97) for the K10 protocol. A combination of considered two-step factor selections and RF-based classifier can better predict the burden of NCDs. The findings of this study suggested that decision-makers might adopt suitable decisions to control and prevent the burden of NCDs using RF classifiers.
doi_str_mv 10.1017/S0021932024000063
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2972704011</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2972704011</sourcerecordid><originalsourceid>FETCH-LOGICAL-c253t-ad7a9d294565a765c2eb5cbfcf2ec90606dacf8309322892333135065d9123c73</originalsourceid><addsrcrecordid>eNplkD9PwzAQxS0EoqXwAViQR5bA2Y6TegTEP6mIAZgjx760Rokd7Gbg25OohYVb7qTfe0-6R8g5gysGrLx-A-BMCQ48h3EKcUDmLC9UVkqlDsl8wtnEZ-QkpU8AJkDJYzITSwlSCTUn_kWbjfNIW9TRO7-mXbDYJtqESPuI1pmtC56Ghtow1C1S7S3dRtePZz1Eiz5N0AefmdB1g3dGTzLrEuqEiTpPb7Vft9pi2pySo0a3Cc_2e0E-Hu7f756y1evj893NKjNcim2mbamV5SqXhdRlIQ3HWpq6MQ1Ho6CAwmrTLMdvBOdLxYUQTEgopFWMC1OKBbnc5fYxfA2YtlXnksG21R7DkCquSl5CDoyNUraTmhhSithUfXSdjt8Vg2qqufpX8-i52McPdYf2z_Hbq_gBqgN4CQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2972704011</pqid></control><display><type>article</type><title>Machine learning models for prediction of double and triple burdens of non-communicable diseases in Bangladesh</title><source>Cambridge Journals Online</source><creator>Al-Zubayer, Md Akib ; Alam, Khorshed ; Shanto, Hasibul Hasan ; Maniruzzaman, Md ; Majumder, Uttam Kumar ; Ahammed, Benojir</creator><creatorcontrib>Al-Zubayer, Md Akib ; Alam, Khorshed ; Shanto, Hasibul Hasan ; Maniruzzaman, Md ; Majumder, Uttam Kumar ; Ahammed, Benojir</creatorcontrib><description>Increasing prevalence of non-communicable diseases (NCDs) has become the leading cause of death and disability in Bangladesh. Therefore, this study aimed to measure the prevalence of and risk factors for double and triple burden of NCDs (DBNCDs and TBNCDs), considering diabetes, hypertension, and overweight and obesity as well as establish a machine learning approach for predicting DBNCDs and TBNCDs. A total of 12,151 respondents from the 2017 to 2018 Bangladesh Demographic and Health Survey were included in this analysis, where 10%, 27.4%, and 24.3% of respondents had diabetes, hypertension, and overweight and obesity, respectively. Chi-square test and multilevel logistic regression (LR) analysis were applied to select factors associated with DBNCDs and TBNCDs. Furthermore, six classifiers including decision tree (DT), LR, naïve Bayes (NB), k-nearest neighbour (KNN), random forest (RF), and extreme gradient boosting (XGBoost) with three cross-validation protocols (K2, K5, and K10) were adopted to predict the status of DBNCDs and TBNCDs. The classification accuracy (ACC) and area under the curve (AUC) were computed for each protocol and repeated 10 times to make them more robust, and then the average ACC and AUC were computed. The prevalence of DBNCDs and TBNCDs was 14.3% and 2.3%, respectively. The findings of this study revealed that DBNCDs and TBNCDs were significantly influenced by age, sex, marital status, wealth index, education and geographic region. Compared to other classifiers, the RF-based classifier provides the highest ACC and AUC for both DBNCDs (ACC = 81.06% and AUC = 0.93) and TBNCDs (ACC = 88.61% and AUC = 0.97) for the K10 protocol. A combination of considered two-step factor selections and RF-based classifier can better predict the burden of NCDs. The findings of this study suggested that decision-makers might adopt suitable decisions to control and prevent the burden of NCDs using RF classifiers.</description><identifier>ISSN: 0021-9320</identifier><identifier>EISSN: 1469-7599</identifier><identifier>DOI: 10.1017/S0021932024000063</identifier><identifier>PMID: 38505939</identifier><language>eng</language><publisher>England</publisher><ispartof>Journal of biosocial science, 2024-05, Vol.56 (3), p.426-444</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c253t-ad7a9d294565a765c2eb5cbfcf2ec90606dacf8309322892333135065d9123c73</cites><orcidid>0000-0003-2232-0745 ; 0000-0001-8208-9579 ; 0000-0001-7127-1869 ; 0000-0001-6660-047X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38505939$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Al-Zubayer, Md Akib</creatorcontrib><creatorcontrib>Alam, Khorshed</creatorcontrib><creatorcontrib>Shanto, Hasibul Hasan</creatorcontrib><creatorcontrib>Maniruzzaman, Md</creatorcontrib><creatorcontrib>Majumder, Uttam Kumar</creatorcontrib><creatorcontrib>Ahammed, Benojir</creatorcontrib><title>Machine learning models for prediction of double and triple burdens of non-communicable diseases in Bangladesh</title><title>Journal of biosocial science</title><addtitle>J Biosoc Sci</addtitle><description>Increasing prevalence of non-communicable diseases (NCDs) has become the leading cause of death and disability in Bangladesh. Therefore, this study aimed to measure the prevalence of and risk factors for double and triple burden of NCDs (DBNCDs and TBNCDs), considering diabetes, hypertension, and overweight and obesity as well as establish a machine learning approach for predicting DBNCDs and TBNCDs. A total of 12,151 respondents from the 2017 to 2018 Bangladesh Demographic and Health Survey were included in this analysis, where 10%, 27.4%, and 24.3% of respondents had diabetes, hypertension, and overweight and obesity, respectively. Chi-square test and multilevel logistic regression (LR) analysis were applied to select factors associated with DBNCDs and TBNCDs. Furthermore, six classifiers including decision tree (DT), LR, naïve Bayes (NB), k-nearest neighbour (KNN), random forest (RF), and extreme gradient boosting (XGBoost) with three cross-validation protocols (K2, K5, and K10) were adopted to predict the status of DBNCDs and TBNCDs. The classification accuracy (ACC) and area under the curve (AUC) were computed for each protocol and repeated 10 times to make them more robust, and then the average ACC and AUC were computed. The prevalence of DBNCDs and TBNCDs was 14.3% and 2.3%, respectively. The findings of this study revealed that DBNCDs and TBNCDs were significantly influenced by age, sex, marital status, wealth index, education and geographic region. Compared to other classifiers, the RF-based classifier provides the highest ACC and AUC for both DBNCDs (ACC = 81.06% and AUC = 0.93) and TBNCDs (ACC = 88.61% and AUC = 0.97) for the K10 protocol. A combination of considered two-step factor selections and RF-based classifier can better predict the burden of NCDs. The findings of this study suggested that decision-makers might adopt suitable decisions to control and prevent the burden of NCDs using RF classifiers.</description><issn>0021-9320</issn><issn>1469-7599</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNplkD9PwzAQxS0EoqXwAViQR5bA2Y6TegTEP6mIAZgjx760Rokd7Gbg25OohYVb7qTfe0-6R8g5gysGrLx-A-BMCQ48h3EKcUDmLC9UVkqlDsl8wtnEZ-QkpU8AJkDJYzITSwlSCTUn_kWbjfNIW9TRO7-mXbDYJtqESPuI1pmtC56Ghtow1C1S7S3dRtePZz1Eiz5N0AefmdB1g3dGTzLrEuqEiTpPb7Vft9pi2pySo0a3Cc_2e0E-Hu7f756y1evj893NKjNcim2mbamV5SqXhdRlIQ3HWpq6MQ1Ho6CAwmrTLMdvBOdLxYUQTEgopFWMC1OKBbnc5fYxfA2YtlXnksG21R7DkCquSl5CDoyNUraTmhhSithUfXSdjt8Vg2qqufpX8-i52McPdYf2z_Hbq_gBqgN4CQ</recordid><startdate>20240501</startdate><enddate>20240501</enddate><creator>Al-Zubayer, Md Akib</creator><creator>Alam, Khorshed</creator><creator>Shanto, Hasibul Hasan</creator><creator>Maniruzzaman, Md</creator><creator>Majumder, Uttam Kumar</creator><creator>Ahammed, Benojir</creator><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0003-2232-0745</orcidid><orcidid>https://orcid.org/0000-0001-8208-9579</orcidid><orcidid>https://orcid.org/0000-0001-7127-1869</orcidid><orcidid>https://orcid.org/0000-0001-6660-047X</orcidid></search><sort><creationdate>20240501</creationdate><title>Machine learning models for prediction of double and triple burdens of non-communicable diseases in Bangladesh</title><author>Al-Zubayer, Md Akib ; Alam, Khorshed ; Shanto, Hasibul Hasan ; Maniruzzaman, Md ; Majumder, Uttam Kumar ; Ahammed, Benojir</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c253t-ad7a9d294565a765c2eb5cbfcf2ec90606dacf8309322892333135065d9123c73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Al-Zubayer, Md Akib</creatorcontrib><creatorcontrib>Alam, Khorshed</creatorcontrib><creatorcontrib>Shanto, Hasibul Hasan</creatorcontrib><creatorcontrib>Maniruzzaman, Md</creatorcontrib><creatorcontrib>Majumder, Uttam Kumar</creatorcontrib><creatorcontrib>Ahammed, Benojir</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of biosocial science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Al-Zubayer, Md Akib</au><au>Alam, Khorshed</au><au>Shanto, Hasibul Hasan</au><au>Maniruzzaman, Md</au><au>Majumder, Uttam Kumar</au><au>Ahammed, Benojir</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Machine learning models for prediction of double and triple burdens of non-communicable diseases in Bangladesh</atitle><jtitle>Journal of biosocial science</jtitle><addtitle>J Biosoc Sci</addtitle><date>2024-05-01</date><risdate>2024</risdate><volume>56</volume><issue>3</issue><spage>426</spage><epage>444</epage><pages>426-444</pages><issn>0021-9320</issn><eissn>1469-7599</eissn><abstract>Increasing prevalence of non-communicable diseases (NCDs) has become the leading cause of death and disability in Bangladesh. Therefore, this study aimed to measure the prevalence of and risk factors for double and triple burden of NCDs (DBNCDs and TBNCDs), considering diabetes, hypertension, and overweight and obesity as well as establish a machine learning approach for predicting DBNCDs and TBNCDs. A total of 12,151 respondents from the 2017 to 2018 Bangladesh Demographic and Health Survey were included in this analysis, where 10%, 27.4%, and 24.3% of respondents had diabetes, hypertension, and overweight and obesity, respectively. Chi-square test and multilevel logistic regression (LR) analysis were applied to select factors associated with DBNCDs and TBNCDs. Furthermore, six classifiers including decision tree (DT), LR, naïve Bayes (NB), k-nearest neighbour (KNN), random forest (RF), and extreme gradient boosting (XGBoost) with three cross-validation protocols (K2, K5, and K10) were adopted to predict the status of DBNCDs and TBNCDs. The classification accuracy (ACC) and area under the curve (AUC) were computed for each protocol and repeated 10 times to make them more robust, and then the average ACC and AUC were computed. The prevalence of DBNCDs and TBNCDs was 14.3% and 2.3%, respectively. The findings of this study revealed that DBNCDs and TBNCDs were significantly influenced by age, sex, marital status, wealth index, education and geographic region. Compared to other classifiers, the RF-based classifier provides the highest ACC and AUC for both DBNCDs (ACC = 81.06% and AUC = 0.93) and TBNCDs (ACC = 88.61% and AUC = 0.97) for the K10 protocol. A combination of considered two-step factor selections and RF-based classifier can better predict the burden of NCDs. The findings of this study suggested that decision-makers might adopt suitable decisions to control and prevent the burden of NCDs using RF classifiers.</abstract><cop>England</cop><pmid>38505939</pmid><doi>10.1017/S0021932024000063</doi><tpages>19</tpages><orcidid>https://orcid.org/0000-0003-2232-0745</orcidid><orcidid>https://orcid.org/0000-0001-8208-9579</orcidid><orcidid>https://orcid.org/0000-0001-7127-1869</orcidid><orcidid>https://orcid.org/0000-0001-6660-047X</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0021-9320
ispartof Journal of biosocial science, 2024-05, Vol.56 (3), p.426-444
issn 0021-9320
1469-7599
language eng
recordid cdi_proquest_miscellaneous_2972704011
source Cambridge Journals Online
title Machine learning models for prediction of double and triple burdens of non-communicable diseases in Bangladesh
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-10T03%3A00%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Machine%20learning%20models%20for%20prediction%20of%20double%20and%20triple%20burdens%20of%20non-communicable%20diseases%20in%20Bangladesh&rft.jtitle=Journal%20of%20biosocial%20science&rft.au=Al-Zubayer,%20Md%20Akib&rft.date=2024-05-01&rft.volume=56&rft.issue=3&rft.spage=426&rft.epage=444&rft.pages=426-444&rft.issn=0021-9320&rft.eissn=1469-7599&rft_id=info:doi/10.1017/S0021932024000063&rft_dat=%3Cproquest_cross%3E2972704011%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c253t-ad7a9d294565a765c2eb5cbfcf2ec90606dacf8309322892333135065d9123c73%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2972704011&rft_id=info:pmid/38505939&rfr_iscdi=true