Loading…

Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm

The Skeletal Oncology Research Group machine-learning algorithm (SORG-MLA) was developed to predict the survival of patients with spinal metastasis. The algorithm was successfully tested in five international institutions using 1101 patients from different continents. The incorporation of 18 prognos...

Full description

Saved in:
Bibliographic Details
Published in:Clinical orthopaedics and related research 2024-01, Vol.482 (1), p.143-157
Main Authors: Huang, Chi-Ching, Peng, Kuang-Ping, Hsieh, Hsiang-Chieh, Groot, Olivier Q., Yen, Hung-Kuan, Tsai, Cheng-Chen, Karhade, Aditya V., Lin, Yen-Po, Kao, Yin-Tien, Yang, Jiun-Jen, Dai, Shih-Hsiang, Huang, Chuan-Ching, Chen, Chih-Wei, Yen, Mao-Hsu, Xiao, Fu-Ren, Lin, Wei-Hsin, Verlaan, Jorrit-Jan, Schwab, Joseph H., Hsu, Feng-Ming, Wong, Tzehong, Yang, Rong-Sen, Yang, Shu-Hua, Hu, Ming-Hsiao
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c3867-6ef70a8dcafb56793f7228b386ceceb92e3bf4e45404b2b87e4058a7488696473
container_end_page 157
container_issue 1
container_start_page 143
container_title Clinical orthopaedics and related research
container_volume 482
creator Huang, Chi-Ching
Peng, Kuang-Ping
Hsieh, Hsiang-Chieh
Groot, Olivier Q.
Yen, Hung-Kuan
Tsai, Cheng-Chen
Karhade, Aditya V.
Lin, Yen-Po
Kao, Yin-Tien
Yang, Jiun-Jen
Dai, Shih-Hsiang
Huang, Chuan-Ching
Chen, Chih-Wei
Yen, Mao-Hsu
Xiao, Fu-Ren
Lin, Wei-Hsin
Verlaan, Jorrit-Jan
Schwab, Joseph H.
Hsu, Feng-Ming
Wong, Tzehong
Yang, Rong-Sen
Yang, Shu-Hua
Hu, Ming-Hsiao
description The Skeletal Oncology Research Group machine-learning algorithm (SORG-MLA) was developed to predict the survival of patients with spinal metastasis. The algorithm was successfully tested in five international institutions using 1101 patients from different continents. The incorporation of 18 prognostic factors strengthens its predictive ability but limits its clinical utility because some prognostic factors might not be clinically available when a clinician wishes to make a prediction. We performed this study to (1) evaluate the SORG-MLA's performance with data and (2) develop an internet-based application to impute the missing data. A total of 2768 patients were included in this study. The data of 617 patients who were treated surgically were intentionally erased, and the data of the other 2151 patients who were treated with radiotherapy and medical treatment were used to impute the artificially missing data. Compared with those who were treated nonsurgically, patients undergoing surgery were younger (median 59 years [IQR 51 to 67 years] versus median 62 years [IQR 53 to 71 years]) and had a higher proportion of patients with at least three spinal metastatic levels (77% [474 of 617] versus 72% [1547 of 2151]), more neurologic deficit (normal American Spinal Injury Association [E] 68% [301 of 443] versus 79% [1227 of 1561]), higher BMI (23 kg/m 2 [IQR 20 to 25 kg/m 2 ] versus 22 kg/m 2 [IQR 20 to 25 kg/m 2 ]), higher platelet count (240 × 10 3 /µL [IQR 173 to 327 × 10 3 /µL] versus 227 × 10 3 /µL [IQR 165 to 302 × 10 3 /µL], higher lymphocyte count (15 × 10 3 /µL [IQR 9 to 21× 10 3 /µL] versus 14 × 10 3 /µL [IQR 8 to 21 × 10 3 /µL]), lower serum creatinine level (0.7 mg/dL [IQR 0.6 to 0.9 mg/dL] versus 0.8 mg/dL [IQR 0.6 to 1.0 mg/dL]), less previous systemic therapy (19% [115 of 617] versus 24% [526 of 2151]), fewer Charlson comorbidities other than cancer (28% [170 of 617] versus 36% [770 of 2151]), and longer median survival. The two patient groups did not differ in other regards. These findings aligned with our institutional philosophy of selecting patients for surgical intervention based on their level of favorable prognostic factors such as BMI or lymphocyte counts and lower levels of unfavorable prognostic factors such as white blood cell counts or serum creatinine level, as well as the degree of spinal instability and severity of neurologic deficits. This approach aims to identify patients with better survival outcomes and prioritize their surgical
doi_str_mv 10.1097/CORR.0000000000002706
format article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_10723864</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2904232175</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3867-6ef70a8dcafb56793f7228b386ceceb92e3bf4e45404b2b87e4058a7488696473</originalsourceid><addsrcrecordid>eNpdkt9u0zAUxiMEYmXwCCBL3HCT4T9J7FxNVQtj0qpOHQjuLCc7aTwcO9juJh6M98OhZYxZliyf8zufj4--LHtN8AnBNX-_WG82J_jBohxXT7IZKanICWH0aTZL0TqvKfl2lL0I4SZdWVHS59kR4wxXFa1n2a-lg4BiD-jSQwDbAnIdWukQtN2ipYoKzbsO2rhnwHfOD-qATaGr9eYMrVTbawu5AeXtVDg3W-d17AeUeHSpogYbA_qaQuhq1FYZtIKoQto6nKIl3IJx45CgSVdZdG4jeAsRzcfR6DYJOPtP9WX2rFMmwKvDeZx9-fjh8-JTfrE-O1_ML_KWiYrnFXQcK3Hdqq4pK16zjlMqmpRroYWmpsCaroCiLHDR0EZwKHApFC-EqOqq4Ow4O93rjrtmgOs29eeVkaPXg_I_pVNa_p-xupdbdysJ5jQ9UySFdwcF737sIEQ56NCCMcqC2wVJBS1JyQTHCX37CL1xO59GlagaF5RRwstElXuq9S4ED919NwTLyRlycoZ87IxU9-bhV-6r_lohAcUeuHMmzT58N7s78LIHZWL_R49hUeV1fVDNMaGUs9_WecZg</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2904232175</pqid></control><display><type>article</type><title>Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm</title><source>PubMed Central</source><creator>Huang, Chi-Ching ; Peng, Kuang-Ping ; Hsieh, Hsiang-Chieh ; Groot, Olivier Q. ; Yen, Hung-Kuan ; Tsai, Cheng-Chen ; Karhade, Aditya V. ; Lin, Yen-Po ; Kao, Yin-Tien ; Yang, Jiun-Jen ; Dai, Shih-Hsiang ; Huang, Chuan-Ching ; Chen, Chih-Wei ; Yen, Mao-Hsu ; Xiao, Fu-Ren ; Lin, Wei-Hsin ; Verlaan, Jorrit-Jan ; Schwab, Joseph H. ; Hsu, Feng-Ming ; Wong, Tzehong ; Yang, Rong-Sen ; Yang, Shu-Hua ; Hu, Ming-Hsiao</creator><creatorcontrib>Huang, Chi-Ching ; Peng, Kuang-Ping ; Hsieh, Hsiang-Chieh ; Groot, Olivier Q. ; Yen, Hung-Kuan ; Tsai, Cheng-Chen ; Karhade, Aditya V. ; Lin, Yen-Po ; Kao, Yin-Tien ; Yang, Jiun-Jen ; Dai, Shih-Hsiang ; Huang, Chuan-Ching ; Chen, Chih-Wei ; Yen, Mao-Hsu ; Xiao, Fu-Ren ; Lin, Wei-Hsin ; Verlaan, Jorrit-Jan ; Schwab, Joseph H. ; Hsu, Feng-Ming ; Wong, Tzehong ; Yang, Rong-Sen ; Yang, Shu-Hua ; Hu, Ming-Hsiao</creatorcontrib><description>The Skeletal Oncology Research Group machine-learning algorithm (SORG-MLA) was developed to predict the survival of patients with spinal metastasis. The algorithm was successfully tested in five international institutions using 1101 patients from different continents. The incorporation of 18 prognostic factors strengthens its predictive ability but limits its clinical utility because some prognostic factors might not be clinically available when a clinician wishes to make a prediction. We performed this study to (1) evaluate the SORG-MLA's performance with data and (2) develop an internet-based application to impute the missing data. A total of 2768 patients were included in this study. The data of 617 patients who were treated surgically were intentionally erased, and the data of the other 2151 patients who were treated with radiotherapy and medical treatment were used to impute the artificially missing data. Compared with those who were treated nonsurgically, patients undergoing surgery were younger (median 59 years [IQR 51 to 67 years] versus median 62 years [IQR 53 to 71 years]) and had a higher proportion of patients with at least three spinal metastatic levels (77% [474 of 617] versus 72% [1547 of 2151]), more neurologic deficit (normal American Spinal Injury Association [E] 68% [301 of 443] versus 79% [1227 of 1561]), higher BMI (23 kg/m 2 [IQR 20 to 25 kg/m 2 ] versus 22 kg/m 2 [IQR 20 to 25 kg/m 2 ]), higher platelet count (240 × 10 3 /µL [IQR 173 to 327 × 10 3 /µL] versus 227 × 10 3 /µL [IQR 165 to 302 × 10 3 /µL], higher lymphocyte count (15 × 10 3 /µL [IQR 9 to 21× 10 3 /µL] versus 14 × 10 3 /µL [IQR 8 to 21 × 10 3 /µL]), lower serum creatinine level (0.7 mg/dL [IQR 0.6 to 0.9 mg/dL] versus 0.8 mg/dL [IQR 0.6 to 1.0 mg/dL]), less previous systemic therapy (19% [115 of 617] versus 24% [526 of 2151]), fewer Charlson comorbidities other than cancer (28% [170 of 617] versus 36% [770 of 2151]), and longer median survival. The two patient groups did not differ in other regards. These findings aligned with our institutional philosophy of selecting patients for surgical intervention based on their level of favorable prognostic factors such as BMI or lymphocyte counts and lower levels of unfavorable prognostic factors such as white blood cell counts or serum creatinine level, as well as the degree of spinal instability and severity of neurologic deficits. This approach aims to identify patients with better survival outcomes and prioritize their surgical intervention accordingly. Seven factors (serum albumin and alkaline phosphatase levels, international normalized ratio, lymphocyte and neutrophil counts, and the presence of visceral or brain metastases) were considered possible missing items based on five previous validation studies and clinical experience. Artificially missing data were imputed using the missForest imputation technique, which was previously applied and successfully tested to fit the SORG-MLA in validation studies. Discrimination, calibration, overall performance, and decision curve analysis were applied to evaluate the SORG-MLA's performance. The discrimination ability was measured with an area under the receiver operating characteristic curve. It ranges from 0.5 to 1.0, with 0.5 indicating the worst discrimination and 1.0 indicating perfect discrimination. An area under the curve of 0.7 is considered clinically acceptable discrimination. Calibration refers to the agreement between the predicted outcomes and actual outcomes. An ideal calibration model will yield predicted survival rates that are congruent with the observed survival rates. The Brier score measures the squared difference between the actual outcome and predicted probability, which captures calibration and discrimination ability simultaneously. A Brier score of 0 indicates perfect prediction, whereas a Brier score of 1 indicates the poorest prediction. A decision curve analysis was performed for the 6-week, 90-day, and 1-year prediction models to evaluate their net benefit across different threshold probabilities. Using the results from our analysis, we developed an internet-based application that facilitates real-time data imputation for clinical decision-making at the point of care. This tool allows healthcare professionals to efficiently and effectively address missing data, ensuring that patient care remains optimal at all times. Generally, the SORG-MLA demonstrated good discriminatory ability, with areas under the curve greater than 0.7 in most cases, and good overall performance, with up to 25% improvement in Brier scores in the presence of one to three missing items. The only exceptions were albumin level and lymphocyte count, because the SORG-MLA's performance was reduced when these two items were missing, indicating that the SORG-MLA might be unreliable without these values. The model tended to underestimate the patient survival rate. As the number of missing items increased, the model's discriminatory ability was progressively impaired, and a marked underestimation of patient survival rates was observed. Specifically, when three items were missing, the number of actual survivors was up to 1.3 times greater than the number of expected survivors, while only 10% discrepancy was observed when only one item was missing. When either two or three items were omitted, the decision curves exhibited substantial overlap, indicating a lack of consistent disparities in performance. This finding suggests that the SORG-MLA consistently generates accurate predictions, regardless of the two or three items that are omitted. We developed an internet application ( https://sorg-spine-mets-missing-data-imputation.azurewebsites.net/ ) that allows the use of SORG-MLA with up to three missing items. The SORG-MLA generally performed well in the presence of one to three missing items, except for serum albumin level and lymphocyte count (which are essential for adequate predictions, even using our modified version of the SORG-MLA). We recommend that future studies should develop prediction models that allow for their use when there are missing data, or provide a means to impute those missing data, because some data are not available at the time a clinical decision must be made. The results suggested the algorithm could be helpful when a radiologic evaluation owing to a lengthy waiting period cannot be performed in time, especially in situations when an early operation could be beneficial. It could help orthopaedic surgeons to decide whether to intervene palliatively or extensively, even when the surgical indication is clear.</description><identifier>ISSN: 0009-921X</identifier><identifier>ISSN: 1528-1132</identifier><identifier>EISSN: 1528-1132</identifier><identifier>DOI: 10.1097/CORR.0000000000002706</identifier><identifier>PMID: 37306629</identifier><language>eng</language><publisher>United States: Wolters Kluwer</publisher><subject>Aged ; Albumin ; Algorithms ; Alkaline phosphatase ; Calibration ; Cell number ; Clinical Research ; Comorbidity ; Creatinine ; Decision making ; Decision Support Techniques ; Female ; Humans ; Internet ; Learning algorithms ; Leukocytes (neutrophilic) ; Lymphocytes ; Machine Learning ; Male ; Medical prognosis ; Medical treatment ; Metastases ; Metastasis ; Middle Aged ; Missing data ; Patients ; Prediction models ; Predictive Value of Tests ; Prognosis ; Radiation therapy ; Retrospective Studies ; Risk Factors ; Spinal Neoplasms - mortality ; Spinal Neoplasms - secondary ; Spinal Neoplasms - surgery ; Spinal Neoplasms - therapy ; Validation studies</subject><ispartof>Clinical orthopaedics and related research, 2024-01, Vol.482 (1), p.143-157</ispartof><rights>Wolters Kluwer</rights><rights>Copyright © 2023 by the Association of Bone and Joint Surgeons.</rights><rights>2023 by the Association of Bone and Joint Surgeons</rights><rights>2023 by the Association of Bone and Joint Surgeons 2023</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c3867-6ef70a8dcafb56793f7228b386ceceb92e3bf4e45404b2b87e4058a7488696473</cites><orcidid>0000-0002-3123-3224 ; 0000-0002-5752-7917</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC10723864/pdf/$$EPDF$$P50$$Gpubmedcentral$$H</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC10723864/$$EHTML$$P50$$Gpubmedcentral$$H</linktohtml><link.rule.ids>230,314,727,780,784,885,27924,27925,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/37306629$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Huang, Chi-Ching</creatorcontrib><creatorcontrib>Peng, Kuang-Ping</creatorcontrib><creatorcontrib>Hsieh, Hsiang-Chieh</creatorcontrib><creatorcontrib>Groot, Olivier Q.</creatorcontrib><creatorcontrib>Yen, Hung-Kuan</creatorcontrib><creatorcontrib>Tsai, Cheng-Chen</creatorcontrib><creatorcontrib>Karhade, Aditya V.</creatorcontrib><creatorcontrib>Lin, Yen-Po</creatorcontrib><creatorcontrib>Kao, Yin-Tien</creatorcontrib><creatorcontrib>Yang, Jiun-Jen</creatorcontrib><creatorcontrib>Dai, Shih-Hsiang</creatorcontrib><creatorcontrib>Huang, Chuan-Ching</creatorcontrib><creatorcontrib>Chen, Chih-Wei</creatorcontrib><creatorcontrib>Yen, Mao-Hsu</creatorcontrib><creatorcontrib>Xiao, Fu-Ren</creatorcontrib><creatorcontrib>Lin, Wei-Hsin</creatorcontrib><creatorcontrib>Verlaan, Jorrit-Jan</creatorcontrib><creatorcontrib>Schwab, Joseph H.</creatorcontrib><creatorcontrib>Hsu, Feng-Ming</creatorcontrib><creatorcontrib>Wong, Tzehong</creatorcontrib><creatorcontrib>Yang, Rong-Sen</creatorcontrib><creatorcontrib>Yang, Shu-Hua</creatorcontrib><creatorcontrib>Hu, Ming-Hsiao</creatorcontrib><title>Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm</title><title>Clinical orthopaedics and related research</title><addtitle>Clin Orthop Relat Res</addtitle><description>The Skeletal Oncology Research Group machine-learning algorithm (SORG-MLA) was developed to predict the survival of patients with spinal metastasis. The algorithm was successfully tested in five international institutions using 1101 patients from different continents. The incorporation of 18 prognostic factors strengthens its predictive ability but limits its clinical utility because some prognostic factors might not be clinically available when a clinician wishes to make a prediction. We performed this study to (1) evaluate the SORG-MLA's performance with data and (2) develop an internet-based application to impute the missing data. A total of 2768 patients were included in this study. The data of 617 patients who were treated surgically were intentionally erased, and the data of the other 2151 patients who were treated with radiotherapy and medical treatment were used to impute the artificially missing data. Compared with those who were treated nonsurgically, patients undergoing surgery were younger (median 59 years [IQR 51 to 67 years] versus median 62 years [IQR 53 to 71 years]) and had a higher proportion of patients with at least three spinal metastatic levels (77% [474 of 617] versus 72% [1547 of 2151]), more neurologic deficit (normal American Spinal Injury Association [E] 68% [301 of 443] versus 79% [1227 of 1561]), higher BMI (23 kg/m 2 [IQR 20 to 25 kg/m 2 ] versus 22 kg/m 2 [IQR 20 to 25 kg/m 2 ]), higher platelet count (240 × 10 3 /µL [IQR 173 to 327 × 10 3 /µL] versus 227 × 10 3 /µL [IQR 165 to 302 × 10 3 /µL], higher lymphocyte count (15 × 10 3 /µL [IQR 9 to 21× 10 3 /µL] versus 14 × 10 3 /µL [IQR 8 to 21 × 10 3 /µL]), lower serum creatinine level (0.7 mg/dL [IQR 0.6 to 0.9 mg/dL] versus 0.8 mg/dL [IQR 0.6 to 1.0 mg/dL]), less previous systemic therapy (19% [115 of 617] versus 24% [526 of 2151]), fewer Charlson comorbidities other than cancer (28% [170 of 617] versus 36% [770 of 2151]), and longer median survival. The two patient groups did not differ in other regards. These findings aligned with our institutional philosophy of selecting patients for surgical intervention based on their level of favorable prognostic factors such as BMI or lymphocyte counts and lower levels of unfavorable prognostic factors such as white blood cell counts or serum creatinine level, as well as the degree of spinal instability and severity of neurologic deficits. This approach aims to identify patients with better survival outcomes and prioritize their surgical intervention accordingly. Seven factors (serum albumin and alkaline phosphatase levels, international normalized ratio, lymphocyte and neutrophil counts, and the presence of visceral or brain metastases) were considered possible missing items based on five previous validation studies and clinical experience. Artificially missing data were imputed using the missForest imputation technique, which was previously applied and successfully tested to fit the SORG-MLA in validation studies. Discrimination, calibration, overall performance, and decision curve analysis were applied to evaluate the SORG-MLA's performance. The discrimination ability was measured with an area under the receiver operating characteristic curve. It ranges from 0.5 to 1.0, with 0.5 indicating the worst discrimination and 1.0 indicating perfect discrimination. An area under the curve of 0.7 is considered clinically acceptable discrimination. Calibration refers to the agreement between the predicted outcomes and actual outcomes. An ideal calibration model will yield predicted survival rates that are congruent with the observed survival rates. The Brier score measures the squared difference between the actual outcome and predicted probability, which captures calibration and discrimination ability simultaneously. A Brier score of 0 indicates perfect prediction, whereas a Brier score of 1 indicates the poorest prediction. A decision curve analysis was performed for the 6-week, 90-day, and 1-year prediction models to evaluate their net benefit across different threshold probabilities. Using the results from our analysis, we developed an internet-based application that facilitates real-time data imputation for clinical decision-making at the point of care. This tool allows healthcare professionals to efficiently and effectively address missing data, ensuring that patient care remains optimal at all times. Generally, the SORG-MLA demonstrated good discriminatory ability, with areas under the curve greater than 0.7 in most cases, and good overall performance, with up to 25% improvement in Brier scores in the presence of one to three missing items. The only exceptions were albumin level and lymphocyte count, because the SORG-MLA's performance was reduced when these two items were missing, indicating that the SORG-MLA might be unreliable without these values. The model tended to underestimate the patient survival rate. As the number of missing items increased, the model's discriminatory ability was progressively impaired, and a marked underestimation of patient survival rates was observed. Specifically, when three items were missing, the number of actual survivors was up to 1.3 times greater than the number of expected survivors, while only 10% discrepancy was observed when only one item was missing. When either two or three items were omitted, the decision curves exhibited substantial overlap, indicating a lack of consistent disparities in performance. This finding suggests that the SORG-MLA consistently generates accurate predictions, regardless of the two or three items that are omitted. We developed an internet application ( https://sorg-spine-mets-missing-data-imputation.azurewebsites.net/ ) that allows the use of SORG-MLA with up to three missing items. The SORG-MLA generally performed well in the presence of one to three missing items, except for serum albumin level and lymphocyte count (which are essential for adequate predictions, even using our modified version of the SORG-MLA). We recommend that future studies should develop prediction models that allow for their use when there are missing data, or provide a means to impute those missing data, because some data are not available at the time a clinical decision must be made. The results suggested the algorithm could be helpful when a radiologic evaluation owing to a lengthy waiting period cannot be performed in time, especially in situations when an early operation could be beneficial. It could help orthopaedic surgeons to decide whether to intervene palliatively or extensively, even when the surgical indication is clear.</description><subject>Aged</subject><subject>Albumin</subject><subject>Algorithms</subject><subject>Alkaline phosphatase</subject><subject>Calibration</subject><subject>Cell number</subject><subject>Clinical Research</subject><subject>Comorbidity</subject><subject>Creatinine</subject><subject>Decision making</subject><subject>Decision Support Techniques</subject><subject>Female</subject><subject>Humans</subject><subject>Internet</subject><subject>Learning algorithms</subject><subject>Leukocytes (neutrophilic)</subject><subject>Lymphocytes</subject><subject>Machine Learning</subject><subject>Male</subject><subject>Medical prognosis</subject><subject>Medical treatment</subject><subject>Metastases</subject><subject>Metastasis</subject><subject>Middle Aged</subject><subject>Missing data</subject><subject>Patients</subject><subject>Prediction models</subject><subject>Predictive Value of Tests</subject><subject>Prognosis</subject><subject>Radiation therapy</subject><subject>Retrospective Studies</subject><subject>Risk Factors</subject><subject>Spinal Neoplasms - mortality</subject><subject>Spinal Neoplasms - secondary</subject><subject>Spinal Neoplasms - surgery</subject><subject>Spinal Neoplasms - therapy</subject><subject>Validation studies</subject><issn>0009-921X</issn><issn>1528-1132</issn><issn>1528-1132</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNpdkt9u0zAUxiMEYmXwCCBL3HCT4T9J7FxNVQtj0qpOHQjuLCc7aTwcO9juJh6M98OhZYxZliyf8zufj4--LHtN8AnBNX-_WG82J_jBohxXT7IZKanICWH0aTZL0TqvKfl2lL0I4SZdWVHS59kR4wxXFa1n2a-lg4BiD-jSQwDbAnIdWukQtN2ipYoKzbsO2rhnwHfOD-qATaGr9eYMrVTbawu5AeXtVDg3W-d17AeUeHSpogYbA_qaQuhq1FYZtIKoQto6nKIl3IJx45CgSVdZdG4jeAsRzcfR6DYJOPtP9WX2rFMmwKvDeZx9-fjh8-JTfrE-O1_ML_KWiYrnFXQcK3Hdqq4pK16zjlMqmpRroYWmpsCaroCiLHDR0EZwKHApFC-EqOqq4Ow4O93rjrtmgOs29eeVkaPXg_I_pVNa_p-xupdbdysJ5jQ9UySFdwcF737sIEQ56NCCMcqC2wVJBS1JyQTHCX37CL1xO59GlagaF5RRwstElXuq9S4ED919NwTLyRlycoZ87IxU9-bhV-6r_lohAcUeuHMmzT58N7s78LIHZWL_R49hUeV1fVDNMaGUs9_WecZg</recordid><startdate>20240101</startdate><enddate>20240101</enddate><creator>Huang, Chi-Ching</creator><creator>Peng, Kuang-Ping</creator><creator>Hsieh, Hsiang-Chieh</creator><creator>Groot, Olivier Q.</creator><creator>Yen, Hung-Kuan</creator><creator>Tsai, Cheng-Chen</creator><creator>Karhade, Aditya V.</creator><creator>Lin, Yen-Po</creator><creator>Kao, Yin-Tien</creator><creator>Yang, Jiun-Jen</creator><creator>Dai, Shih-Hsiang</creator><creator>Huang, Chuan-Ching</creator><creator>Chen, Chih-Wei</creator><creator>Yen, Mao-Hsu</creator><creator>Xiao, Fu-Ren</creator><creator>Lin, Wei-Hsin</creator><creator>Verlaan, Jorrit-Jan</creator><creator>Schwab, Joseph H.</creator><creator>Hsu, Feng-Ming</creator><creator>Wong, Tzehong</creator><creator>Yang, Rong-Sen</creator><creator>Yang, Shu-Hua</creator><creator>Hu, Ming-Hsiao</creator><general>Wolters Kluwer</general><general>Lippincott Williams &amp; Wilkins Ovid Technologies</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QP</scope><scope>7T5</scope><scope>H94</scope><scope>K9.</scope><scope>NAPCQ</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-3123-3224</orcidid><orcidid>https://orcid.org/0000-0002-5752-7917</orcidid></search><sort><creationdate>20240101</creationdate><title>Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm</title><author>Huang, Chi-Ching ; Peng, Kuang-Ping ; Hsieh, Hsiang-Chieh ; Groot, Olivier Q. ; Yen, Hung-Kuan ; Tsai, Cheng-Chen ; Karhade, Aditya V. ; Lin, Yen-Po ; Kao, Yin-Tien ; Yang, Jiun-Jen ; Dai, Shih-Hsiang ; Huang, Chuan-Ching ; Chen, Chih-Wei ; Yen, Mao-Hsu ; Xiao, Fu-Ren ; Lin, Wei-Hsin ; Verlaan, Jorrit-Jan ; Schwab, Joseph H. ; Hsu, Feng-Ming ; Wong, Tzehong ; Yang, Rong-Sen ; Yang, Shu-Hua ; Hu, Ming-Hsiao</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3867-6ef70a8dcafb56793f7228b386ceceb92e3bf4e45404b2b87e4058a7488696473</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Aged</topic><topic>Albumin</topic><topic>Algorithms</topic><topic>Alkaline phosphatase</topic><topic>Calibration</topic><topic>Cell number</topic><topic>Clinical Research</topic><topic>Comorbidity</topic><topic>Creatinine</topic><topic>Decision making</topic><topic>Decision Support Techniques</topic><topic>Female</topic><topic>Humans</topic><topic>Internet</topic><topic>Learning algorithms</topic><topic>Leukocytes (neutrophilic)</topic><topic>Lymphocytes</topic><topic>Machine Learning</topic><topic>Male</topic><topic>Medical prognosis</topic><topic>Medical treatment</topic><topic>Metastases</topic><topic>Metastasis</topic><topic>Middle Aged</topic><topic>Missing data</topic><topic>Patients</topic><topic>Prediction models</topic><topic>Predictive Value of Tests</topic><topic>Prognosis</topic><topic>Radiation therapy</topic><topic>Retrospective Studies</topic><topic>Risk Factors</topic><topic>Spinal Neoplasms - mortality</topic><topic>Spinal Neoplasms - secondary</topic><topic>Spinal Neoplasms - surgery</topic><topic>Spinal Neoplasms - therapy</topic><topic>Validation studies</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Huang, Chi-Ching</creatorcontrib><creatorcontrib>Peng, Kuang-Ping</creatorcontrib><creatorcontrib>Hsieh, Hsiang-Chieh</creatorcontrib><creatorcontrib>Groot, Olivier Q.</creatorcontrib><creatorcontrib>Yen, Hung-Kuan</creatorcontrib><creatorcontrib>Tsai, Cheng-Chen</creatorcontrib><creatorcontrib>Karhade, Aditya V.</creatorcontrib><creatorcontrib>Lin, Yen-Po</creatorcontrib><creatorcontrib>Kao, Yin-Tien</creatorcontrib><creatorcontrib>Yang, Jiun-Jen</creatorcontrib><creatorcontrib>Dai, Shih-Hsiang</creatorcontrib><creatorcontrib>Huang, Chuan-Ching</creatorcontrib><creatorcontrib>Chen, Chih-Wei</creatorcontrib><creatorcontrib>Yen, Mao-Hsu</creatorcontrib><creatorcontrib>Xiao, Fu-Ren</creatorcontrib><creatorcontrib>Lin, Wei-Hsin</creatorcontrib><creatorcontrib>Verlaan, Jorrit-Jan</creatorcontrib><creatorcontrib>Schwab, Joseph H.</creatorcontrib><creatorcontrib>Hsu, Feng-Ming</creatorcontrib><creatorcontrib>Wong, Tzehong</creatorcontrib><creatorcontrib>Yang, Rong-Sen</creatorcontrib><creatorcontrib>Yang, Shu-Hua</creatorcontrib><creatorcontrib>Hu, Ming-Hsiao</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Calcium &amp; Calcified Tissue Abstracts</collection><collection>Immunology Abstracts</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Nursing &amp; Allied Health Premium</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Clinical orthopaedics and related research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Huang, Chi-Ching</au><au>Peng, Kuang-Ping</au><au>Hsieh, Hsiang-Chieh</au><au>Groot, Olivier Q.</au><au>Yen, Hung-Kuan</au><au>Tsai, Cheng-Chen</au><au>Karhade, Aditya V.</au><au>Lin, Yen-Po</au><au>Kao, Yin-Tien</au><au>Yang, Jiun-Jen</au><au>Dai, Shih-Hsiang</au><au>Huang, Chuan-Ching</au><au>Chen, Chih-Wei</au><au>Yen, Mao-Hsu</au><au>Xiao, Fu-Ren</au><au>Lin, Wei-Hsin</au><au>Verlaan, Jorrit-Jan</au><au>Schwab, Joseph H.</au><au>Hsu, Feng-Ming</au><au>Wong, Tzehong</au><au>Yang, Rong-Sen</au><au>Yang, Shu-Hua</au><au>Hu, Ming-Hsiao</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm</atitle><jtitle>Clinical orthopaedics and related research</jtitle><addtitle>Clin Orthop Relat Res</addtitle><date>2024-01-01</date><risdate>2024</risdate><volume>482</volume><issue>1</issue><spage>143</spage><epage>157</epage><pages>143-157</pages><issn>0009-921X</issn><issn>1528-1132</issn><eissn>1528-1132</eissn><abstract>The Skeletal Oncology Research Group machine-learning algorithm (SORG-MLA) was developed to predict the survival of patients with spinal metastasis. The algorithm was successfully tested in five international institutions using 1101 patients from different continents. The incorporation of 18 prognostic factors strengthens its predictive ability but limits its clinical utility because some prognostic factors might not be clinically available when a clinician wishes to make a prediction. We performed this study to (1) evaluate the SORG-MLA's performance with data and (2) develop an internet-based application to impute the missing data. A total of 2768 patients were included in this study. The data of 617 patients who were treated surgically were intentionally erased, and the data of the other 2151 patients who were treated with radiotherapy and medical treatment were used to impute the artificially missing data. Compared with those who were treated nonsurgically, patients undergoing surgery were younger (median 59 years [IQR 51 to 67 years] versus median 62 years [IQR 53 to 71 years]) and had a higher proportion of patients with at least three spinal metastatic levels (77% [474 of 617] versus 72% [1547 of 2151]), more neurologic deficit (normal American Spinal Injury Association [E] 68% [301 of 443] versus 79% [1227 of 1561]), higher BMI (23 kg/m 2 [IQR 20 to 25 kg/m 2 ] versus 22 kg/m 2 [IQR 20 to 25 kg/m 2 ]), higher platelet count (240 × 10 3 /µL [IQR 173 to 327 × 10 3 /µL] versus 227 × 10 3 /µL [IQR 165 to 302 × 10 3 /µL], higher lymphocyte count (15 × 10 3 /µL [IQR 9 to 21× 10 3 /µL] versus 14 × 10 3 /µL [IQR 8 to 21 × 10 3 /µL]), lower serum creatinine level (0.7 mg/dL [IQR 0.6 to 0.9 mg/dL] versus 0.8 mg/dL [IQR 0.6 to 1.0 mg/dL]), less previous systemic therapy (19% [115 of 617] versus 24% [526 of 2151]), fewer Charlson comorbidities other than cancer (28% [170 of 617] versus 36% [770 of 2151]), and longer median survival. The two patient groups did not differ in other regards. These findings aligned with our institutional philosophy of selecting patients for surgical intervention based on their level of favorable prognostic factors such as BMI or lymphocyte counts and lower levels of unfavorable prognostic factors such as white blood cell counts or serum creatinine level, as well as the degree of spinal instability and severity of neurologic deficits. This approach aims to identify patients with better survival outcomes and prioritize their surgical intervention accordingly. Seven factors (serum albumin and alkaline phosphatase levels, international normalized ratio, lymphocyte and neutrophil counts, and the presence of visceral or brain metastases) were considered possible missing items based on five previous validation studies and clinical experience. Artificially missing data were imputed using the missForest imputation technique, which was previously applied and successfully tested to fit the SORG-MLA in validation studies. Discrimination, calibration, overall performance, and decision curve analysis were applied to evaluate the SORG-MLA's performance. The discrimination ability was measured with an area under the receiver operating characteristic curve. It ranges from 0.5 to 1.0, with 0.5 indicating the worst discrimination and 1.0 indicating perfect discrimination. An area under the curve of 0.7 is considered clinically acceptable discrimination. Calibration refers to the agreement between the predicted outcomes and actual outcomes. An ideal calibration model will yield predicted survival rates that are congruent with the observed survival rates. The Brier score measures the squared difference between the actual outcome and predicted probability, which captures calibration and discrimination ability simultaneously. A Brier score of 0 indicates perfect prediction, whereas a Brier score of 1 indicates the poorest prediction. A decision curve analysis was performed for the 6-week, 90-day, and 1-year prediction models to evaluate their net benefit across different threshold probabilities. Using the results from our analysis, we developed an internet-based application that facilitates real-time data imputation for clinical decision-making at the point of care. This tool allows healthcare professionals to efficiently and effectively address missing data, ensuring that patient care remains optimal at all times. Generally, the SORG-MLA demonstrated good discriminatory ability, with areas under the curve greater than 0.7 in most cases, and good overall performance, with up to 25% improvement in Brier scores in the presence of one to three missing items. The only exceptions were albumin level and lymphocyte count, because the SORG-MLA's performance was reduced when these two items were missing, indicating that the SORG-MLA might be unreliable without these values. The model tended to underestimate the patient survival rate. As the number of missing items increased, the model's discriminatory ability was progressively impaired, and a marked underestimation of patient survival rates was observed. Specifically, when three items were missing, the number of actual survivors was up to 1.3 times greater than the number of expected survivors, while only 10% discrepancy was observed when only one item was missing. When either two or three items were omitted, the decision curves exhibited substantial overlap, indicating a lack of consistent disparities in performance. This finding suggests that the SORG-MLA consistently generates accurate predictions, regardless of the two or three items that are omitted. We developed an internet application ( https://sorg-spine-mets-missing-data-imputation.azurewebsites.net/ ) that allows the use of SORG-MLA with up to three missing items. The SORG-MLA generally performed well in the presence of one to three missing items, except for serum albumin level and lymphocyte count (which are essential for adequate predictions, even using our modified version of the SORG-MLA). We recommend that future studies should develop prediction models that allow for their use when there are missing data, or provide a means to impute those missing data, because some data are not available at the time a clinical decision must be made. The results suggested the algorithm could be helpful when a radiologic evaluation owing to a lengthy waiting period cannot be performed in time, especially in situations when an early operation could be beneficial. It could help orthopaedic surgeons to decide whether to intervene palliatively or extensively, even when the surgical indication is clear.</abstract><cop>United States</cop><pub>Wolters Kluwer</pub><pmid>37306629</pmid><doi>10.1097/CORR.0000000000002706</doi><tpages>15</tpages><orcidid>https://orcid.org/0000-0002-3123-3224</orcidid><orcidid>https://orcid.org/0000-0002-5752-7917</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0009-921X
ispartof Clinical orthopaedics and related research, 2024-01, Vol.482 (1), p.143-157
issn 0009-921X
1528-1132
1528-1132
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_10723864
source PubMed Central
subjects Aged
Albumin
Algorithms
Alkaline phosphatase
Calibration
Cell number
Clinical Research
Comorbidity
Creatinine
Decision making
Decision Support Techniques
Female
Humans
Internet
Learning algorithms
Leukocytes (neutrophilic)
Lymphocytes
Machine Learning
Male
Medical prognosis
Medical treatment
Metastases
Metastasis
Middle Aged
Missing data
Patients
Prediction models
Predictive Value of Tests
Prognosis
Radiation therapy
Retrospective Studies
Risk Factors
Spinal Neoplasms - mortality
Spinal Neoplasms - secondary
Spinal Neoplasms - surgery
Spinal Neoplasms - therapy
Validation studies
title Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T22%3A44%3A15IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Does%20the%20Presence%20of%20Missing%20Data%20Affect%20the%20Performance%20of%20the%20SORG%20Machine-learning%20Algorithm%20for%20Patients%20With%20Spinal%20Metastasis?%20Development%20of%20an%20Internet%20Application%20Algorithm&rft.jtitle=Clinical%20orthopaedics%20and%20related%20research&rft.au=Huang,%20Chi-Ching&rft.date=2024-01-01&rft.volume=482&rft.issue=1&rft.spage=143&rft.epage=157&rft.pages=143-157&rft.issn=0009-921X&rft.eissn=1528-1132&rft_id=info:doi/10.1097/CORR.0000000000002706&rft_dat=%3Cproquest_pubme%3E2904232175%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c3867-6ef70a8dcafb56793f7228b386ceceb92e3bf4e45404b2b87e4058a7488696473%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2904232175&rft_id=info:pmid/37306629&rfr_iscdi=true