Interpretable Deep Regression Models with Interval-Censored Failure Time Data
By: Changhui Yuan , Shishun Zhao , Shuwei Li and more
Potential Business Impact:
Helps predict disease using smart computer guessing.
Deep neural networks (DNNs) have become powerful tools for modeling complex data structures through sequentially integrating simple functions in each hidden layer. In survival analysis, recent advances of DNNs primarily focus on enhancing model capabilities, especially in exploring nonlinear covariate effects under right censoring. However, deep learning methods for interval-censored data, where the unobservable failure time is only known to lie in an interval, remain underexplored and limited to specific data type or model. This work proposes a general regression framework for interval-censored data with a broad class of partially linear transformation models, where key covariate effects are modeled parametrically while nonlinear effects of nuisance multi-modal covariates are approximated via DNNs, balancing interpretability and flexibility. We employ sieve maximum likelihood estimation by leveraging monotone splines to approximate the cumulative baseline hazard function. To ensure reliable and tractable estimation, we develop an EM algorithm incorporating stochastic gradient descent. We establish the asymptotic properties of parameter estimators and show that the DNN estimator achieves minimax-optimal convergence. Extensive simulations demonstrate superior estimation and prediction accuracy over state-of-the-art methods. Applying our method to the Alzheimer's Disease Neuroimaging Initiative dataset yields novel insights and improved predictive performance compared to traditional approaches.
Similar Papers
Deep learning for interval-censored failure time data from case-cohort studies
Methodology
Finds hidden patterns in health data.
Self-Consistent Equation-guided Neural Networks for Censored Time-to-Event Data
Machine Learning (Stat)
Helps predict when patients might get better.
Flexible Deep Neural Networks for Partially Linear Survival Data
Machine Learning (Stat)
Helps doctors predict how long patients will live.