Loading…

Racial Differences in Accuracy of Predictive Models for High-Flow Nasal Cannula Failure in COVID-19

To develop and validate machine learning (ML) models to predict high-flow nasal cannula (HFNC) failure in COVID-19, compare their performance to the respiratory rate-oxygenation (ROX) index, and evaluate model accuracy by self-reported race. Retrospective cohort study. Four Emory University Hospital...

Full description

Saved in:
Bibliographic Details
Published in:Critical care explorations 2024-03, Vol.6 (3), p.e1059
Main Authors: Yang, Philip, Gregory, Ismail A, Robichaux, Chad, Holder, Andre L, Martin, Greg S, Esper, Annette M, Kamaleswaran, Rishikesan, Gichoya, Judy W, Bhavani, Sivasubramanium V
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c424t-37ac932e5446ff2150a4f8edab9d8032d8933ab35e95f1deba0e1aa0dcd7632b3
container_end_page
container_issue 3
container_start_page e1059
container_title Critical care explorations
container_volume 6
creator Yang, Philip
Gregory, Ismail A
Robichaux, Chad
Holder, Andre L
Martin, Greg S
Esper, Annette M
Kamaleswaran, Rishikesan
Gichoya, Judy W
Bhavani, Sivasubramanium V
description To develop and validate machine learning (ML) models to predict high-flow nasal cannula (HFNC) failure in COVID-19, compare their performance to the respiratory rate-oxygenation (ROX) index, and evaluate model accuracy by self-reported race. Retrospective cohort study. Four Emory University Hospitals in Atlanta, GA. Adult patients hospitalized with COVID-19 between March 2020 and April 2022 who received HFNC therapy within 24 hours of ICU admission were included. None. Four types of supervised ML models were developed for predicting HFNC failure (defined as intubation or death within 7 d of HFNC initiation), using routine clinical variables from the first 24 hours of ICU admission. Models were trained on the first 60% ( = 594) of admissions and validated on the latter 40% ( = 390) of admissions to simulate prospective implementation. Among 984 patients included, 317 patients (32.2%) developed HFNC failure. eXtreme Gradient Boosting (XGB) model had the highest area under the receiver-operator characteristic curve (AUROC) for predicting HFNC failure (0.707), and was the only model with significantly better performance than the ROX index (AUROC 0.616). XGB model had significantly worse performance in Black patients compared with White patients (AUROC 0.663 vs. 0.808, = 0.02). Racial differences in the XGB model were reduced and no longer statistically significant when restricted to patients with nonmissing arterial blood gas data, and when XGB model was developed to predict mortality (rather than the composite outcome of failure, which could be influenced by biased clinical decisions for intubation). Our XGB model had better discrimination for predicting HFNC failure in COVID-19 than the ROX index, but had racial differences in accuracy of predictions. Further studies are needed to understand and mitigate potential sources of biases in clinical ML models and to improve their equitability.
doi_str_mv 10.1097/CCE.0000000000001059
format article
fullrecord <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_aa8b5228a84e4813b909d5b356f7c8ec</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_aa8b5228a84e4813b909d5b356f7c8ec</doaj_id><sourcerecordid>3076767113</sourcerecordid><originalsourceid>FETCH-LOGICAL-c424t-37ac932e5446ff2150a4f8edab9d8032d8933ab35e95f1deba0e1aa0dcd7632b3</originalsourceid><addsrcrecordid>eNpdkU9v1DAQxS0EotXSb4CQj1xS_C-JfUJV2qUrFYoQcLUm9njrKhsXe1PUb0-WLdUW--DR-M1vnvQIecvZKWem_dB1F6fs4HBWmxfkWDTSVJoJ_fKgPiInpdzOIsFrXrfqNTmS2rR13bTHxH0DF2Gg5zEEzDg6LDSO9My5KYN7oCnQrxl9dNt4j_Rz8jgUGlKml3F9Uy2H9Jt-gTIDOhjHaQC6hDhMGXeQ7vrn6rzi5g15FWAoePL4LsiP5cX37rK6uv606s6uKqeE2layBWekwFqpJoTZLAMVNHrojddMCq-NlNDLGk0duMceGHIA5p1vGyl6uSCrPdcnuLV3OW4gP9gE0f5tpLy2kLfRDWgBdF8LoUErVJrL3jDj65ndhNZpdDPr4551N_Ub9A7HbYbhGfT5zxhv7DrdW86FUDunC_L-kZDTrwnL1m5icTgMMGKaipWsbebL-U6q9lKXUykZw9Mezuwubzvnbf_Pex57d-jxaehfuvIPXfOlGQ</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3076767113</pqid></control><display><type>article</type><title>Racial Differences in Accuracy of Predictive Models for High-Flow Nasal Cannula Failure in COVID-19</title><source>PubMed (Medline)</source><source>HEAL-Link subscriptions: Lippincott Williams &amp; Wilkins</source><creator>Yang, Philip ; Gregory, Ismail A ; Robichaux, Chad ; Holder, Andre L ; Martin, Greg S ; Esper, Annette M ; Kamaleswaran, Rishikesan ; Gichoya, Judy W ; Bhavani, Sivasubramanium V</creator><creatorcontrib>Yang, Philip ; Gregory, Ismail A ; Robichaux, Chad ; Holder, Andre L ; Martin, Greg S ; Esper, Annette M ; Kamaleswaran, Rishikesan ; Gichoya, Judy W ; Bhavani, Sivasubramanium V</creatorcontrib><description>To develop and validate machine learning (ML) models to predict high-flow nasal cannula (HFNC) failure in COVID-19, compare their performance to the respiratory rate-oxygenation (ROX) index, and evaluate model accuracy by self-reported race. Retrospective cohort study. Four Emory University Hospitals in Atlanta, GA. Adult patients hospitalized with COVID-19 between March 2020 and April 2022 who received HFNC therapy within 24 hours of ICU admission were included. None. Four types of supervised ML models were developed for predicting HFNC failure (defined as intubation or death within 7 d of HFNC initiation), using routine clinical variables from the first 24 hours of ICU admission. Models were trained on the first 60% ( = 594) of admissions and validated on the latter 40% ( = 390) of admissions to simulate prospective implementation. Among 984 patients included, 317 patients (32.2%) developed HFNC failure. eXtreme Gradient Boosting (XGB) model had the highest area under the receiver-operator characteristic curve (AUROC) for predicting HFNC failure (0.707), and was the only model with significantly better performance than the ROX index (AUROC 0.616). XGB model had significantly worse performance in Black patients compared with White patients (AUROC 0.663 vs. 0.808, = 0.02). Racial differences in the XGB model were reduced and no longer statistically significant when restricted to patients with nonmissing arterial blood gas data, and when XGB model was developed to predict mortality (rather than the composite outcome of failure, which could be influenced by biased clinical decisions for intubation). Our XGB model had better discrimination for predicting HFNC failure in COVID-19 than the ROX index, but had racial differences in accuracy of predictions. Further studies are needed to understand and mitigate potential sources of biases in clinical ML models and to improve their equitability.</description><identifier>ISSN: 2639-8028</identifier><identifier>EISSN: 2639-8028</identifier><identifier>DOI: 10.1097/CCE.0000000000001059</identifier><identifier>PMID: 38975567</identifier><language>eng</language><publisher>United States: Lippincott Williams &amp; Wilkins</publisher><subject>Aged ; Cannula ; COVID-19 - ethnology ; COVID-19 - therapy ; Female ; Humans ; Intensive Care Units ; Machine Learning ; Male ; Middle Aged ; Noninvasive Ventilation - methods ; Original Clinical Report ; Oxygen Inhalation Therapy - methods ; Retrospective Studies ; SARS-CoV-2 ; Treatment Failure</subject><ispartof>Critical care explorations, 2024-03, Vol.6 (3), p.e1059</ispartof><rights>Copyright © 2024 The Authors. Published by Wolters Kluwer Health, Inc. on behalf of the Society of Critical Care Medicine.</rights><rights>Copyright © 2024 The Authors. Published by Wolters Kluwer Health, Inc. on behalf of the Society of Critical Care Medicine. 2024</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c424t-37ac932e5446ff2150a4f8edab9d8032d8933ab35e95f1deba0e1aa0dcd7632b3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC11224893/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC11224893/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,27924,27925,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38975567$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Yang, Philip</creatorcontrib><creatorcontrib>Gregory, Ismail A</creatorcontrib><creatorcontrib>Robichaux, Chad</creatorcontrib><creatorcontrib>Holder, Andre L</creatorcontrib><creatorcontrib>Martin, Greg S</creatorcontrib><creatorcontrib>Esper, Annette M</creatorcontrib><creatorcontrib>Kamaleswaran, Rishikesan</creatorcontrib><creatorcontrib>Gichoya, Judy W</creatorcontrib><creatorcontrib>Bhavani, Sivasubramanium V</creatorcontrib><title>Racial Differences in Accuracy of Predictive Models for High-Flow Nasal Cannula Failure in COVID-19</title><title>Critical care explorations</title><addtitle>Crit Care Explor</addtitle><description>To develop and validate machine learning (ML) models to predict high-flow nasal cannula (HFNC) failure in COVID-19, compare their performance to the respiratory rate-oxygenation (ROX) index, and evaluate model accuracy by self-reported race. Retrospective cohort study. Four Emory University Hospitals in Atlanta, GA. Adult patients hospitalized with COVID-19 between March 2020 and April 2022 who received HFNC therapy within 24 hours of ICU admission were included. None. Four types of supervised ML models were developed for predicting HFNC failure (defined as intubation or death within 7 d of HFNC initiation), using routine clinical variables from the first 24 hours of ICU admission. Models were trained on the first 60% ( = 594) of admissions and validated on the latter 40% ( = 390) of admissions to simulate prospective implementation. Among 984 patients included, 317 patients (32.2%) developed HFNC failure. eXtreme Gradient Boosting (XGB) model had the highest area under the receiver-operator characteristic curve (AUROC) for predicting HFNC failure (0.707), and was the only model with significantly better performance than the ROX index (AUROC 0.616). XGB model had significantly worse performance in Black patients compared with White patients (AUROC 0.663 vs. 0.808, = 0.02). Racial differences in the XGB model were reduced and no longer statistically significant when restricted to patients with nonmissing arterial blood gas data, and when XGB model was developed to predict mortality (rather than the composite outcome of failure, which could be influenced by biased clinical decisions for intubation). Our XGB model had better discrimination for predicting HFNC failure in COVID-19 than the ROX index, but had racial differences in accuracy of predictions. Further studies are needed to understand and mitigate potential sources of biases in clinical ML models and to improve their equitability.</description><subject>Aged</subject><subject>Cannula</subject><subject>COVID-19 - ethnology</subject><subject>COVID-19 - therapy</subject><subject>Female</subject><subject>Humans</subject><subject>Intensive Care Units</subject><subject>Machine Learning</subject><subject>Male</subject><subject>Middle Aged</subject><subject>Noninvasive Ventilation - methods</subject><subject>Original Clinical Report</subject><subject>Oxygen Inhalation Therapy - methods</subject><subject>Retrospective Studies</subject><subject>SARS-CoV-2</subject><subject>Treatment Failure</subject><issn>2639-8028</issn><issn>2639-8028</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>DOA</sourceid><recordid>eNpdkU9v1DAQxS0EotXSb4CQj1xS_C-JfUJV2qUrFYoQcLUm9njrKhsXe1PUb0-WLdUW--DR-M1vnvQIecvZKWem_dB1F6fs4HBWmxfkWDTSVJoJ_fKgPiInpdzOIsFrXrfqNTmS2rR13bTHxH0DF2Gg5zEEzDg6LDSO9My5KYN7oCnQrxl9dNt4j_Rz8jgUGlKml3F9Uy2H9Jt-gTIDOhjHaQC6hDhMGXeQ7vrn6rzi5g15FWAoePL4LsiP5cX37rK6uv606s6uKqeE2layBWekwFqpJoTZLAMVNHrojddMCq-NlNDLGk0duMceGHIA5p1vGyl6uSCrPdcnuLV3OW4gP9gE0f5tpLy2kLfRDWgBdF8LoUErVJrL3jDj65ndhNZpdDPr4551N_Ub9A7HbYbhGfT5zxhv7DrdW86FUDunC_L-kZDTrwnL1m5icTgMMGKaipWsbebL-U6q9lKXUykZw9Mezuwubzvnbf_Pex57d-jxaehfuvIPXfOlGQ</recordid><startdate>20240301</startdate><enddate>20240301</enddate><creator>Yang, Philip</creator><creator>Gregory, Ismail A</creator><creator>Robichaux, Chad</creator><creator>Holder, Andre L</creator><creator>Martin, Greg S</creator><creator>Esper, Annette M</creator><creator>Kamaleswaran, Rishikesan</creator><creator>Gichoya, Judy W</creator><creator>Bhavani, Sivasubramanium V</creator><general>Lippincott Williams &amp; Wilkins</general><general>Wolters Kluwer</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope></search><sort><creationdate>20240301</creationdate><title>Racial Differences in Accuracy of Predictive Models for High-Flow Nasal Cannula Failure in COVID-19</title><author>Yang, Philip ; Gregory, Ismail A ; Robichaux, Chad ; Holder, Andre L ; Martin, Greg S ; Esper, Annette M ; Kamaleswaran, Rishikesan ; Gichoya, Judy W ; Bhavani, Sivasubramanium V</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c424t-37ac932e5446ff2150a4f8edab9d8032d8933ab35e95f1deba0e1aa0dcd7632b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Aged</topic><topic>Cannula</topic><topic>COVID-19 - ethnology</topic><topic>COVID-19 - therapy</topic><topic>Female</topic><topic>Humans</topic><topic>Intensive Care Units</topic><topic>Machine Learning</topic><topic>Male</topic><topic>Middle Aged</topic><topic>Noninvasive Ventilation - methods</topic><topic>Original Clinical Report</topic><topic>Oxygen Inhalation Therapy - methods</topic><topic>Retrospective Studies</topic><topic>SARS-CoV-2</topic><topic>Treatment Failure</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yang, Philip</creatorcontrib><creatorcontrib>Gregory, Ismail A</creatorcontrib><creatorcontrib>Robichaux, Chad</creatorcontrib><creatorcontrib>Holder, Andre L</creatorcontrib><creatorcontrib>Martin, Greg S</creatorcontrib><creatorcontrib>Esper, Annette M</creatorcontrib><creatorcontrib>Kamaleswaran, Rishikesan</creatorcontrib><creatorcontrib>Gichoya, Judy W</creatorcontrib><creatorcontrib>Bhavani, Sivasubramanium V</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>Critical care explorations</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yang, Philip</au><au>Gregory, Ismail A</au><au>Robichaux, Chad</au><au>Holder, Andre L</au><au>Martin, Greg S</au><au>Esper, Annette M</au><au>Kamaleswaran, Rishikesan</au><au>Gichoya, Judy W</au><au>Bhavani, Sivasubramanium V</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Racial Differences in Accuracy of Predictive Models for High-Flow Nasal Cannula Failure in COVID-19</atitle><jtitle>Critical care explorations</jtitle><addtitle>Crit Care Explor</addtitle><date>2024-03-01</date><risdate>2024</risdate><volume>6</volume><issue>3</issue><spage>e1059</spage><pages>e1059-</pages><issn>2639-8028</issn><eissn>2639-8028</eissn><abstract>To develop and validate machine learning (ML) models to predict high-flow nasal cannula (HFNC) failure in COVID-19, compare their performance to the respiratory rate-oxygenation (ROX) index, and evaluate model accuracy by self-reported race. Retrospective cohort study. Four Emory University Hospitals in Atlanta, GA. Adult patients hospitalized with COVID-19 between March 2020 and April 2022 who received HFNC therapy within 24 hours of ICU admission were included. None. Four types of supervised ML models were developed for predicting HFNC failure (defined as intubation or death within 7 d of HFNC initiation), using routine clinical variables from the first 24 hours of ICU admission. Models were trained on the first 60% ( = 594) of admissions and validated on the latter 40% ( = 390) of admissions to simulate prospective implementation. Among 984 patients included, 317 patients (32.2%) developed HFNC failure. eXtreme Gradient Boosting (XGB) model had the highest area under the receiver-operator characteristic curve (AUROC) for predicting HFNC failure (0.707), and was the only model with significantly better performance than the ROX index (AUROC 0.616). XGB model had significantly worse performance in Black patients compared with White patients (AUROC 0.663 vs. 0.808, = 0.02). Racial differences in the XGB model were reduced and no longer statistically significant when restricted to patients with nonmissing arterial blood gas data, and when XGB model was developed to predict mortality (rather than the composite outcome of failure, which could be influenced by biased clinical decisions for intubation). Our XGB model had better discrimination for predicting HFNC failure in COVID-19 than the ROX index, but had racial differences in accuracy of predictions. Further studies are needed to understand and mitigate potential sources of biases in clinical ML models and to improve their equitability.</abstract><cop>United States</cop><pub>Lippincott Williams &amp; Wilkins</pub><pmid>38975567</pmid><doi>10.1097/CCE.0000000000001059</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2639-8028
ispartof Critical care explorations, 2024-03, Vol.6 (3), p.e1059
issn 2639-8028
2639-8028
language eng
recordid cdi_doaj_primary_oai_doaj_org_article_aa8b5228a84e4813b909d5b356f7c8ec
source PubMed (Medline); HEAL-Link subscriptions: Lippincott Williams & Wilkins
subjects Aged
Cannula
COVID-19 - ethnology
COVID-19 - therapy
Female
Humans
Intensive Care Units
Machine Learning
Male
Middle Aged
Noninvasive Ventilation - methods
Original Clinical Report
Oxygen Inhalation Therapy - methods
Retrospective Studies
SARS-CoV-2
Treatment Failure
title Racial Differences in Accuracy of Predictive Models for High-Flow Nasal Cannula Failure in COVID-19
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T00%3A06%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Racial%20Differences%20in%20Accuracy%20of%20Predictive%20Models%20for%20High-Flow%20Nasal%20Cannula%20Failure%20in%20COVID-19&rft.jtitle=Critical%20care%20explorations&rft.au=Yang,%20Philip&rft.date=2024-03-01&rft.volume=6&rft.issue=3&rft.spage=e1059&rft.pages=e1059-&rft.issn=2639-8028&rft.eissn=2639-8028&rft_id=info:doi/10.1097/CCE.0000000000001059&rft_dat=%3Cproquest_doaj_%3E3076767113%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c424t-37ac932e5446ff2150a4f8edab9d8032d8933ab35e95f1deba0e1aa0dcd7632b3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3076767113&rft_id=info:pmid/38975567&rfr_iscdi=true