Loading…

Machine learning–based outcome prediction and novel hypotheses generation for substance use disorder treatment

Abstract Objective Substance use disorder is a critical public health issue. Discovering the synergies among factors impacting treatment program success can help governments and treatment facilities develop effective policies. In this work, we propose a novel data analytics approach using machine le...

Full description

Saved in:
Bibliographic Details
Published in:Journal of the American Medical Informatics Association : JAMIA 2021-06, Vol.28 (6), p.1216-1224
Main Authors: Nasir, Murtaza, Summerfield, Nichalin S, Oztekin, Asil, Knight, Margaret, Ackerson, Leland K, Carreiro, Stephanie
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c420t-77b9233464c433785f10732cbb43f9d0bce2b3c5af5d9263f71e5da06c0138f73
cites cdi_FETCH-LOGICAL-c420t-77b9233464c433785f10732cbb43f9d0bce2b3c5af5d9263f71e5da06c0138f73
container_end_page 1224
container_issue 6
container_start_page 1216
container_title Journal of the American Medical Informatics Association : JAMIA
container_volume 28
creator Nasir, Murtaza
Summerfield, Nichalin S
Oztekin, Asil
Knight, Margaret
Ackerson, Leland K
Carreiro, Stephanie
description Abstract Objective Substance use disorder is a critical public health issue. Discovering the synergies among factors impacting treatment program success can help governments and treatment facilities develop effective policies. In this work, we propose a novel data analytics approach using machine learning models to discover interaction effects that might be neglected by traditional hypothesis-generating approaches. Materials and Methods A patient-episode-level substance use treatment discharge dataset and a Federal Bureau of Investigation crime dataset were joined using core-based statistical area codes. Random forests, artificial neural networks, and extreme gradient boosting were applied with a nested cross-validation methodology. Interaction effects were identified based on the machine learning model with the best performance. These interaction effects were analyzed and tested using traditional logistic regression models on unseen data. Results In predicting patient completion of a treatment program, extreme gradient boosting performed the best with an area under the curve of 89.31%. Based on our procedure, 73 interaction effects were identified. Among these, 14 were tested using traditional logistic regression models where 12 were statistically significant (P
doi_str_mv 10.1093/jamia/ocaa350
format article
fullrecord <record><control><sourceid>oup_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8661425</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><oup_id>10.1093/jamia/ocaa350</oup_id><sourcerecordid>10.1093/jamia/ocaa350</sourcerecordid><originalsourceid>FETCH-LOGICAL-c420t-77b9233464c433785f10732cbb43f9d0bce2b3c5af5d9263f71e5da06c0138f73</originalsourceid><addsrcrecordid>eNqFkE1LxDAQhoMorl9Hr5Kjl2rSNE17EUT8AsWLgrcyTaa7kTYpSSvszf_gP_SXuO6qrCdPM_A-8wy8hBxydsJZKU5foLNw6jWAkGyD7HCZqqRU2fPm2j4huzG-MMbzVMhtMhFCKsazYof096Bn1iFtEYKzbvrx9l5DREP9OGjfIe0DGqsH6x0FZ6jzr9jS2bz3wwwjRjpFhwGWeeMDjWMdB3Aa6RiRGht9MBjoEBCGDt2wT7YaaCMefM898nR1-Xhxk9w9XN9enN8lOkvZkChVl6kQWZ7pTAhVyIYzJVJd15loSsNqjWkttIRGmjLNRaM4SgMs14yLolFij5ytvP1Yd2j04nWAtuqD7SDMKw-2-ps4O6um_rUq8pxnqVwIkpVABx9jwOb3lrPqq_pqWX31Xf2CP1p_-Ev_dL0AjleAH_t_XJ-DoJSI</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Machine learning–based outcome prediction and novel hypotheses generation for substance use disorder treatment</title><source>Oxford Journals Online</source><source>PubMed Central</source><creator>Nasir, Murtaza ; Summerfield, Nichalin S ; Oztekin, Asil ; Knight, Margaret ; Ackerson, Leland K ; Carreiro, Stephanie</creator><creatorcontrib>Nasir, Murtaza ; Summerfield, Nichalin S ; Oztekin, Asil ; Knight, Margaret ; Ackerson, Leland K ; Carreiro, Stephanie</creatorcontrib><description>Abstract Objective Substance use disorder is a critical public health issue. Discovering the synergies among factors impacting treatment program success can help governments and treatment facilities develop effective policies. In this work, we propose a novel data analytics approach using machine learning models to discover interaction effects that might be neglected by traditional hypothesis-generating approaches. Materials and Methods A patient-episode-level substance use treatment discharge dataset and a Federal Bureau of Investigation crime dataset were joined using core-based statistical area codes. Random forests, artificial neural networks, and extreme gradient boosting were applied with a nested cross-validation methodology. Interaction effects were identified based on the machine learning model with the best performance. These interaction effects were analyzed and tested using traditional logistic regression models on unseen data. Results In predicting patient completion of a treatment program, extreme gradient boosting performed the best with an area under the curve of 89.31%. Based on our procedure, 73 interaction effects were identified. Among these, 14 were tested using traditional logistic regression models where 12 were statistically significant (P&lt;.05). Conclusions We identified new interaction effects among the length of stay, frequency of substance use, changes in self-help group attendance frequency, and other factors. This work provides insights into the interactions between factors impacting treatment completion. Further traditional statistical analysis can be employed by practitioners and policy makers to test the effects discovered by our novel machine learning approach.</description><identifier>ISSN: 1527-974X</identifier><identifier>ISSN: 1067-5027</identifier><identifier>EISSN: 1527-974X</identifier><identifier>DOI: 10.1093/jamia/ocaa350</identifier><identifier>PMID: 33570148</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Humans ; Logistic Models ; Machine Learning ; Neural Networks, Computer ; Prognosis ; Research and Applications ; Substance-Related Disorders - therapy</subject><ispartof>Journal of the American Medical Informatics Association : JAMIA, 2021-06, Vol.28 (6), p.1216-1224</ispartof><rights>The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com 2021</rights><rights>The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c420t-77b9233464c433785f10732cbb43f9d0bce2b3c5af5d9263f71e5da06c0138f73</citedby><cites>FETCH-LOGICAL-c420t-77b9233464c433785f10732cbb43f9d0bce2b3c5af5d9263f71e5da06c0138f73</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8661425/pdf/$$EPDF$$P50$$Gpubmedcentral$$H</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8661425/$$EHTML$$P50$$Gpubmedcentral$$H</linktohtml><link.rule.ids>230,314,727,780,784,885,27923,27924,53790,53792</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/33570148$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Nasir, Murtaza</creatorcontrib><creatorcontrib>Summerfield, Nichalin S</creatorcontrib><creatorcontrib>Oztekin, Asil</creatorcontrib><creatorcontrib>Knight, Margaret</creatorcontrib><creatorcontrib>Ackerson, Leland K</creatorcontrib><creatorcontrib>Carreiro, Stephanie</creatorcontrib><title>Machine learning–based outcome prediction and novel hypotheses generation for substance use disorder treatment</title><title>Journal of the American Medical Informatics Association : JAMIA</title><addtitle>J Am Med Inform Assoc</addtitle><description>Abstract Objective Substance use disorder is a critical public health issue. Discovering the synergies among factors impacting treatment program success can help governments and treatment facilities develop effective policies. In this work, we propose a novel data analytics approach using machine learning models to discover interaction effects that might be neglected by traditional hypothesis-generating approaches. Materials and Methods A patient-episode-level substance use treatment discharge dataset and a Federal Bureau of Investigation crime dataset were joined using core-based statistical area codes. Random forests, artificial neural networks, and extreme gradient boosting were applied with a nested cross-validation methodology. Interaction effects were identified based on the machine learning model with the best performance. These interaction effects were analyzed and tested using traditional logistic regression models on unseen data. Results In predicting patient completion of a treatment program, extreme gradient boosting performed the best with an area under the curve of 89.31%. Based on our procedure, 73 interaction effects were identified. Among these, 14 were tested using traditional logistic regression models where 12 were statistically significant (P&lt;.05). Conclusions We identified new interaction effects among the length of stay, frequency of substance use, changes in self-help group attendance frequency, and other factors. This work provides insights into the interactions between factors impacting treatment completion. Further traditional statistical analysis can be employed by practitioners and policy makers to test the effects discovered by our novel machine learning approach.</description><subject>Humans</subject><subject>Logistic Models</subject><subject>Machine Learning</subject><subject>Neural Networks, Computer</subject><subject>Prognosis</subject><subject>Research and Applications</subject><subject>Substance-Related Disorders - therapy</subject><issn>1527-974X</issn><issn>1067-5027</issn><issn>1527-974X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNqFkE1LxDAQhoMorl9Hr5Kjl2rSNE17EUT8AsWLgrcyTaa7kTYpSSvszf_gP_SXuO6qrCdPM_A-8wy8hBxydsJZKU5foLNw6jWAkGyD7HCZqqRU2fPm2j4huzG-MMbzVMhtMhFCKsazYof096Bn1iFtEYKzbvrx9l5DREP9OGjfIe0DGqsH6x0FZ6jzr9jS2bz3wwwjRjpFhwGWeeMDjWMdB3Aa6RiRGht9MBjoEBCGDt2wT7YaaCMefM898nR1-Xhxk9w9XN9enN8lOkvZkChVl6kQWZ7pTAhVyIYzJVJd15loSsNqjWkttIRGmjLNRaM4SgMs14yLolFij5ytvP1Yd2j04nWAtuqD7SDMKw-2-ps4O6um_rUq8pxnqVwIkpVABx9jwOb3lrPqq_pqWX31Xf2CP1p_-Ev_dL0AjleAH_t_XJ-DoJSI</recordid><startdate>20210612</startdate><enddate>20210612</enddate><creator>Nasir, Murtaza</creator><creator>Summerfield, Nichalin S</creator><creator>Oztekin, Asil</creator><creator>Knight, Margaret</creator><creator>Ackerson, Leland K</creator><creator>Carreiro, Stephanie</creator><general>Oxford University Press</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>5PM</scope></search><sort><creationdate>20210612</creationdate><title>Machine learning–based outcome prediction and novel hypotheses generation for substance use disorder treatment</title><author>Nasir, Murtaza ; Summerfield, Nichalin S ; Oztekin, Asil ; Knight, Margaret ; Ackerson, Leland K ; Carreiro, Stephanie</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c420t-77b9233464c433785f10732cbb43f9d0bce2b3c5af5d9263f71e5da06c0138f73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Humans</topic><topic>Logistic Models</topic><topic>Machine Learning</topic><topic>Neural Networks, Computer</topic><topic>Prognosis</topic><topic>Research and Applications</topic><topic>Substance-Related Disorders - therapy</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Nasir, Murtaza</creatorcontrib><creatorcontrib>Summerfield, Nichalin S</creatorcontrib><creatorcontrib>Oztekin, Asil</creatorcontrib><creatorcontrib>Knight, Margaret</creatorcontrib><creatorcontrib>Ackerson, Leland K</creatorcontrib><creatorcontrib>Carreiro, Stephanie</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of the American Medical Informatics Association : JAMIA</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Nasir, Murtaza</au><au>Summerfield, Nichalin S</au><au>Oztekin, Asil</au><au>Knight, Margaret</au><au>Ackerson, Leland K</au><au>Carreiro, Stephanie</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Machine learning–based outcome prediction and novel hypotheses generation for substance use disorder treatment</atitle><jtitle>Journal of the American Medical Informatics Association : JAMIA</jtitle><addtitle>J Am Med Inform Assoc</addtitle><date>2021-06-12</date><risdate>2021</risdate><volume>28</volume><issue>6</issue><spage>1216</spage><epage>1224</epage><pages>1216-1224</pages><issn>1527-974X</issn><issn>1067-5027</issn><eissn>1527-974X</eissn><abstract>Abstract Objective Substance use disorder is a critical public health issue. Discovering the synergies among factors impacting treatment program success can help governments and treatment facilities develop effective policies. In this work, we propose a novel data analytics approach using machine learning models to discover interaction effects that might be neglected by traditional hypothesis-generating approaches. Materials and Methods A patient-episode-level substance use treatment discharge dataset and a Federal Bureau of Investigation crime dataset were joined using core-based statistical area codes. Random forests, artificial neural networks, and extreme gradient boosting were applied with a nested cross-validation methodology. Interaction effects were identified based on the machine learning model with the best performance. These interaction effects were analyzed and tested using traditional logistic regression models on unseen data. Results In predicting patient completion of a treatment program, extreme gradient boosting performed the best with an area under the curve of 89.31%. Based on our procedure, 73 interaction effects were identified. Among these, 14 were tested using traditional logistic regression models where 12 were statistically significant (P&lt;.05). Conclusions We identified new interaction effects among the length of stay, frequency of substance use, changes in self-help group attendance frequency, and other factors. This work provides insights into the interactions between factors impacting treatment completion. Further traditional statistical analysis can be employed by practitioners and policy makers to test the effects discovered by our novel machine learning approach.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>33570148</pmid><doi>10.1093/jamia/ocaa350</doi><tpages>9</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1527-974X
ispartof Journal of the American Medical Informatics Association : JAMIA, 2021-06, Vol.28 (6), p.1216-1224
issn 1527-974X
1067-5027
1527-974X
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8661425
source Oxford Journals Online; PubMed Central
subjects Humans
Logistic Models
Machine Learning
Neural Networks, Computer
Prognosis
Research and Applications
Substance-Related Disorders - therapy
title Machine learning–based outcome prediction and novel hypotheses generation for substance use disorder treatment
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-12T23%3A00%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-oup_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Machine%20learning%E2%80%93based%20outcome%20prediction%20and%20novel%20hypotheses%20generation%20for%20substance%20use%20disorder%20treatment&rft.jtitle=Journal%20of%20the%20American%20Medical%20Informatics%20Association%20:%20JAMIA&rft.au=Nasir,%20Murtaza&rft.date=2021-06-12&rft.volume=28&rft.issue=6&rft.spage=1216&rft.epage=1224&rft.pages=1216-1224&rft.issn=1527-974X&rft.eissn=1527-974X&rft_id=info:doi/10.1093/jamia/ocaa350&rft_dat=%3Coup_pubme%3E10.1093/jamia/ocaa350%3C/oup_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c420t-77b9233464c433785f10732cbb43f9d0bce2b3c5af5d9263f71e5da06c0138f73%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/33570148&rft_oup_id=10.1093/jamia/ocaa350&rfr_iscdi=true