Loading…

Integration of multi-omics data for survival prediction of lung adenocarcinoma

•SBMOI could integrate multi-omics data to construct mutation gene vectors.•SBMOI + RSF successfully predict long and short term survival of LUAD patients.•Compared with SCN and PPI, the FCN based SBMOI+ RSF model had better performance. The morbidity of lung adenocarcinoma (LUAD) has been increasin...

Full description

Saved in:
Bibliographic Details
Published in:Computer methods and programs in biomedicine 2024-06, Vol.250, p.108192-108192, Article 108192
Main Authors: Guo, Dingjie, Wang, Yixian, Chen, Jing, Liu, Xin
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•SBMOI could integrate multi-omics data to construct mutation gene vectors.•SBMOI + RSF successfully predict long and short term survival of LUAD patients.•Compared with SCN and PPI, the FCN based SBMOI+ RSF model had better performance. The morbidity of lung adenocarcinoma (LUAD) has been increasing year by year and the prognosis is poor. This has prompted researchers to study the survival of LUAD patients to ensure that patients can be cured in time or survive after appropriate treatment. There is still no fully valid model that can be applied to clinical practice. We introduced struc2vec-based multi-omics data integration (SBMOI), which could integrate gene expression, somatic mutations and clinical data to construct mutation gene vectors representing LUAD patient features. Based on the patient features, the random survival forest (RSF) model was used to predict the long- and short-term survival of LUAD patients. To further demonstrate the superiority of SBMOI, we simultaneously replaced scale-free gene co-expression network (FCN) with a protein-protein interaction (PPI) network and a significant co-expression network (SCN) to compare accuracy in predicting LUAD patient survival under the same conditions. Our results suggested that compared with SCN and PPI network, the FCN based SBMOI combined with RSF model had better performance in long- and short-term survival prediction tasks for LUAD patients. The AUC of 1-year, 5-year, and 10-year survival in the validation dataset were 0.791, 0.825, and 0.917, respectively. This study provided a powerful network-based method to multi-omics data integration. SBMOI combined with RSF successfully predicted long- and short-term survival of LUAD patients, especially with high accuracy on long-term survival. Besides, SBMOI algorithm has the potential to combine with other machine learning models to complete clustering or stratificational tasks, and being applied to other diseases.
ISSN:0169-2607
1872-7565
DOI:10.1016/j.cmpb.2024.108192