Score: 1

Beyond Cox Models: Assessing the Performance of Machine-Learning Methods in Non-Proportional Hazards and Non-Linear Survival Analysis

Published: April 24, 2025 | arXiv ID: 2504.17568v2

By: Ivan Rossi , Flavio Sartori , Cesare Rollo and more

Potential Business Impact:

Finds better ways to predict when people will get sick.

Business Areas:

Predictive Analytics Artificial Intelligence, Data and Analytics, Software

Survival analysis often relies on Cox models, assuming both linearity and proportional hazards (PH). This study evaluates machine and deep learning methods that relax these constraints, comparing their performance with penalized Cox models on a benchmark of three synthetic and three real datasets. In total, eight different models were tested, including six non-linear models of which four were also non-PH. Although Cox regression often yielded satisfactory performance, we showed the conditions under which machine and deep learning models can perform better. Indeed, the performance of these methods has often been underestimated due to the improper use of Harrell's concordance index (C-index) instead of more appropriate scores such as Antolini's concordance index, which generalizes C-index in cases where the PH assumption does not hold. In addition, since occasionally high C-index models happen to be badly calibrated, combining Antolini's C-index with Brier's score is useful to assess the overall performance of a survival method. Results on our benchmark data showed that survival prediction should be approached by testing different methods to select the most appropriate one according to sample size, non-linearity and non-PH conditions. To allow an easy reproducibility of these tests on our benchmark data, code and documentation are freely available at https://github.com/compbiomed-unito/survhive.

Comprehensive Benchmarking of Machine Learning Methods for Risk Prediction Modelling from Large-Scale Survival Data: A UK Biobank Study

Machine Learning (CS)

Finds best computer models to predict health risks.

11 Mar 2025 0

88%

Extending Cox Proportional Hazards Model with Symbolic Non-Linear Log-Risk Functions for Survival Analysis

Machine Learning (CS)

Makes predicting life spans easier and clearer.

6 Apr 2025 2

88%

A Flexible Partially Linear Single Index Proportional Hazards Regression Model for Multivariate Survival Data

Methodology

Predicts how long people live, even with many factors.

15 Oct 2025 0

View PDF Login to Bookmark

Country of Origin

🇮🇹 Italy

Repos / Data Links

github.com

Page Count

16 pages

Beyond Cox Models: Assessing the Performance of Machine-Learning Methods in Non-Proportional Hazards and Non-Linear Survival Analysis

Finds better ways to predict when people will get sick.

Technical Abstract

Comprehensive Benchmarking of Machine Learning Methods for Risk Prediction Modelling from Large-Scale Survival Data: A UK Biobank Study

Extending Cox Proportional Hazards Model with Symbolic Non-Linear Log-Risk Functions for Survival Analysis

A Flexible Partially Linear Single Index Proportional Hazards Regression Model for Multivariate Survival Data