Loading…

Predicting Length of Stay of Coronary Artery Bypass Grafting Patients Using Machine Learning

•Machine learning can help clinicians prioritize the most important information.•Most of the predictive value can be captured in only a few variables.•Key variables included age, last creatinine, and intraoperative transfusions. There is a growing need to identify which bits of information are most...

Full description

Saved in:
Bibliographic Details
Published in:The Journal of surgical research 2021-08, Vol.264, p.68-75
Main Authors: Triana, Austin J., Vyas, Rushikesh, Shah, Ashish S., Tiwari, Vikram
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•Machine learning can help clinicians prioritize the most important information.•Most of the predictive value can be captured in only a few variables.•Key variables included age, last creatinine, and intraoperative transfusions. There is a growing need to identify which bits of information are most valuable for healthcare providers. The aim of this study was to search for the highest impact variables in predicting postsurgery length of stay (LOS) for patients who undergo coronary artery bypass grafting (CABG). Using a single institution's Society of Thoracic Surgeons (STS) Registry data, 2121 patients with elective or urgent, isolated CABG were analyzed across 116 variables. Two machine learning techniques of random forest and artificial neural networks (ANNs) were used to search for the highest impact variables in predicting LOS, and results were compared against multiple linear regression. Out-of-sample validation of the models was performed on 105 patients. Of the 10 highest impact variables identified in predicting LOS, four of the most impactful variables were duration intubated, last preoperative creatinine, age, and number of intraoperative packed red blood cell transfusions. The best performing model was an ANN using the ten highest impact variables (testing sample mean absolute error (MAE) = 1.685 d, R2 = 0.232), which performed consistently in the out-of-sample validation (MAE = 1.612 d, R2 = 0.150). Using machine learning, this study identified several novel predictors of postsurgery LOS and reinforced certain known risk factors. Out of the entire STS database, only a few variables carry most of the predictive value for LOS in this population. With this knowledge, a simpler linear regression model has been shared and could be used elsewhere after further validation.
ISSN:0022-4804
1095-8673
DOI:10.1016/j.jss.2021.02.003