Score: 1

Learning Survival Models with Right-Censored Reporting Delays

Published: October 6, 2025 | arXiv ID: 2510.04421v1

By: Yuta Shikuri, Hironori Fujisawa

Potential Business Impact:

Helps insurance know risks faster for new people.

Business Areas:
Predictive Analytics Artificial Intelligence, Data and Analytics, Software

Survival analysis is a statistical technique used to estimate the time until an event occurs. Although it is applied across a wide range of fields, adjusting for reporting delays under practical constraints remains a significant challenge in the insurance industry. Such delays render event occurrences unobservable when their reports are subject to right censoring. This issue becomes particularly critical when estimating hazard rates for newly enrolled cohorts with limited follow-up due to administrative censoring. Our study addresses this challenge by jointly modeling the parametric hazard functions of event occurrences and report timings. The joint probability distribution is marginalized over the latent event occurrence status. We construct an estimator for the proposed survival model and establish its asymptotic consistency. Furthermore, we develop an expectation-maximization algorithm to compute its estimates. Using these findings, we propose a two-stage estimation procedure based on a parametric proportional hazards model to evaluate observations subject to administrative censoring. Experimental results demonstrate that our method effectively improves the timeliness of risk evaluation for newly enrolled cohorts.

Repos / Data Links

Page Count
21 pages

Category
Statistics:
Machine Learning (Stat)