Efficient and scalable clustering of survival curves
By: Nora M. Villanueva, Marta Sestelo, Luis Meira-Machado
Survival analysis encompasses a broad range of methods for analyzing time-to-event data, with one key objective being the comparison of survival curves across groups. Traditional approaches for identifying clusters of survival curves often rely on computationally intensive bootstrap techniques to approximate the null hypothesis distribution. While effective, these methods impose significant computational burdens. In this work, we propose a novel approach that leverages the k-means and log-rank test to efficiently identify and cluster survival curves. Our method eliminates the need for computationally expensive resampling, significantly reducing processing time while maintaining statistical reliability. By systematically evaluating survival curves and determining optimal clusters, the proposed method ensures a practical and scalable alternative for large-scale survival data analysis. Through simulation studies, we demonstrate that our approach achieves results comparable to existing bootstrap-based clustering methods while dramatically improving computational efficiency. These findings suggest that the log-rank-based clustering procedure offers a viable and time-efficient solution for researchers working with multiple survival curves in medical and epidemiological studies.
Similar Papers
Network Meta Analysis of Mean Survival
Applications
Helps doctors pick best medicine using survival data.
Survival Analysis as Imprecise Classification with Trainable Kernels
Machine Learning (CS)
Helps predict when things will happen, even with missing info.
Benchmarking Classical, Machine Learning, and Bayesian Survival Models for Clinical Prediction
Applications
Helps doctors predict patient survival better.