Lung Cancer Survival Prediction Using Machine Learning and Statistical Methods
By: Varun Vishwanathan Nair, Victor Miranda Soberanis
Potential Business Impact:
Predicts lung cancer survival better for patients.
Lung cancer remains one of the leading causes of cancer-related mortality, yet most survival models rely only on baseline factors and overlook posttreatment variables that reflect disease progression. To address this gap, we applied Cox Proportional Hazards and Random Survival Forests, integrating baseline features with post-treatment predictors such as progression-free interval (PFI.time) and residual tumor status. The Cox model achieved a concordance index (C-index) of 0.90, while the RSF model reached 0.86, both outperforming previous studies. Beyond statistical gains, the integration of post-treatment variables provides oncologists with more clinically meaningful and reliable survival estimates. This enables improved treatment planning, more personalized patient counseling, and better-informed follow-up strategies. From a practical standpoint, these results demonstrate how routinely collected clinical variables can be transformed into actionable survival predictions.
Similar Papers
Benchmarking Classical, Machine Learning, and Bayesian Survival Models for Clinical Prediction
Applications
Helps doctors predict patient survival better.
A Flexible Partially Linear Single Index Proportional Hazards Regression Model for Multivariate Survival Data
Methodology
Predicts how long people live, even with many factors.
Methodology for Comparing Machine Learning Algorithms for Survival Analysis
Machine Learning (CS)
Helps doctors guess how long cancer patients will live.