Score: 0

Statistical Robustness of Interval CVaR Based Regression Models under Perturbation and Contamination

Published: January 16, 2026 | arXiv ID: 2601.11420v1

By: Yulei You, Junyi Liu

Potential Business Impact:

Makes computer learning ignore bad data points.

Business Areas:

A/B Testing Data and Analytics

Robustness under perturbation and contamination is a prominent issue in statistical learning. We address the robust nonlinear regression based on the so-called interval conditional value-at-risk (In-CVaR), which is introduced to enhance robustness by trimming extreme losses. While recent literature shows that the In-CVaR based statistical learning exhibits superior robustness performance than classical robust regression models, its theoretical robustness analysis for nonlinear regression remains largely unexplored. We rigorously quantify robustness under contamination, with a unified study of distributional breakdown point for a broad class of regression models, including linear, piecewise affine and neural network models with $\ell_1$, $\ell_2$ and Huber losses. Moreover, we analyze the qualitative robustness of the In-CVaR based estimator under perturbation. We show that under several minor assumptions, the In-CVaR based estimator is qualitatively robust in terms of the Prokhorov metric if and only if the largest portion of losses is trimmed. Overall, this study analyzes robustness properties of In-CVaR based nonlinear regression models under both perturbation and contamination, which illustrates the advantages of In-CVaR risk measure over conditional value-at-risk and expectation for robust regression in both theory and numerical experiments.

Confidence Intervals for Linear Models with Arbitrary Noise Contamination

Statistics Theory

Finds reliable answers even with bad data.

10 Nov 2025 0

88%

Estimation of the Coefficient of Variation of Weibull Distribution under Type-I Progressively Interval Censoring: A Simulation-based Approach

Methodology

Helps predict when things will break.

20 Nov 2025 0

88%

On Design of Representative Distributionally Robust Formulations for Evaluation of Tail Risk Measures

Risk Management

Finds the worst possible money loss safely.

19 Jun 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

49 pages

Statistical Robustness of Interval CVaR Based Regression Models under Perturbation and Contamination

Makes computer learning ignore bad data points.

Technical Abstract

Confidence Intervals for Linear Models with Arbitrary Noise Contamination

Estimation of the Coefficient of Variation of Weibull Distribution under Type-I Progressively Interval Censoring: A Simulation-based Approach

On Design of Representative Distributionally Robust Formulations for Evaluation of Tail Risk Measures