Human Versus Machine in Survival Prediction for Metastatic Spinal Cord Compression: A Retrospective Cohort Study
Rodrigo Muscogliati1, Elie Najjar1, Vaishnavi Sharma1, Khalid Salem1
1Centre for Spinal Studies and Surgery, Queens Medical Centre, Nottingham University Hospitals NHS Trust, Nottingham, United Kingdom
Objective:
To compare the accuracy of oncologist judgment, surgeon-calculated Tokuhashi scores, and ChatGPT-assisted predictions in estimating survival outcomes in metastatic spinal cord compression (MSCC) patients.
Background:
Accurate survival prediction in metastatic spinal cord compression (MSCC) is critical for guiding treatment decisions, yet remains challenging, particularly for intermediate survival durations. There is ambiguity surrounding the most accurate and validated method in estimating prognosis in MSCC patients. We aimed to address this uncertainty in our study.
Design/Methods:
This retrospective study included 99 consecutive patients referred to a tertiary spinal oncology center with radiologically confirmed MSCC. Anonymized clinical data were used to calculate surgeon Tokuhashi scores, document oncologist-estimated life expectancy, and generate ChatGPT-assisted survival predictions based on both literature review and Tokuhashi calculation. Predictions were compared against actual survival outcomes (<6 months, 6–12 months, >12 months). Machine learning analyses identified key predictors of survival.
Results:
Overall prediction accuracies were 54% for ChatGPT Tokuhashi-based predictions, 50% for surgeon Tokuhashi scores, 48% for oncologist judgment, and 37% for ChatGPT literature-based estimates. Recall for short survival (<6 months) was highest with surgeon (71%) and ChatGPT Tokuhashi (69%) methods, whereas intermediate survival (6–12 months) remained difficult to predict across all modalities. Functional status (Karnofsky score) and patient age emerged as the strongest survival predictors across logistic regression, random forest, decision tree, and XGBoost models, surpassing primary tumor type and metastasis burden.
Conclusions:
Structured prognostic tools and AI-assisted scoring can complement clinical judgment in predicting short-term survival in MSCC. However, intermediate-term survival prediction remains a critical unmet need. Future prognostic strategies should prioritize dynamic functional metrics over static tumor classifications to improve personalized decision-making.
Disclaimer: Abstracts were not reviewed by Neurology® and do not reflect the views of Neurology® editors or staff.