Loading…

Machine Learning Incorporating Host Factors for Predicting Survival in Head and Neck Squamous Cell Carcinoma Patients

Prognostication for cancer patients is integral for patient counseling and treatment planning, yet providing accurate prediction can be challenging using existing patient-specific clinical indicators and host factors. In this work, we evaluated common machine learning models in predicting head and n...

Full description

Saved in:
Bibliographic Details
Published in:Cancers 2021-09, Vol.13 (18), p.4559
Main Authors: Yu, Han, Ma, Sung Jun, Farrugia, Mark, Iovoli, Austin J., Wooten, Kimberly E., Gupta, Vishal, McSpadden, Ryan P., Kuriakose, Moni A., Markiewicz, Michael R., Chan, Jon M., Hicks, Wesley L., Platek, Mary E., Singh, Anurag K.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Prognostication for cancer patients is integral for patient counseling and treatment planning, yet providing accurate prediction can be challenging using existing patient-specific clinical indicators and host factors. In this work, we evaluated common machine learning models in predicting head and neck squamous cell carcinoma (HNSCC) patients’ overall survival based on demographic, clinical features and host factors. We found random survival forest had best performance among the models evaluated, which achieved a C-index of 0.729 and AUROC of 0.792 in predicting two-year overall survival. In addition, we verified that host factors are independently predictive of HNSCC overall survival, which improved the C-index by a margin of 0.026 and the AUROC by 0.034. Due to the strong correlation among host factors, we showed that proper dimension reduction is an important step before their incorporation into the machine learning models, which provides a host factor score reflecting the patients’ nutrition and inflammation status. The score by itself showed excellent discriminating capacity with the high-risk group having a hazard ratio of 3.76 (1.93–7.32, p < 0.0001) over the low-risk group. The hazard ratios were further improved to 7.41 (3.66–14.98, p < 0.0001) by the random survival forest model after including demographic and clinical features.
ISSN:2072-6694
2072-6694
DOI:10.3390/cancers13184559