Statistical Robustness of Interval CVaR Based Regression Models under Perturbation and Contamination
By: Yulei You, Junyi Liu
Potential Business Impact:
Makes computer learning ignore bad data points.
Robustness under perturbation and contamination is a prominent issue in statistical learning. We address the robust nonlinear regression based on the so-called interval conditional value-at-risk (In-CVaR), which is introduced to enhance robustness by trimming extreme losses. While recent literature shows that the In-CVaR based statistical learning exhibits superior robustness performance than classical robust regression models, its theoretical robustness analysis for nonlinear regression remains largely unexplored. We rigorously quantify robustness under contamination, with a unified study of distributional breakdown point for a broad class of regression models, including linear, piecewise affine and neural network models with $\ell_1$, $\ell_2$ and Huber losses. Moreover, we analyze the qualitative robustness of the In-CVaR based estimator under perturbation. We show that under several minor assumptions, the In-CVaR based estimator is qualitatively robust in terms of the Prokhorov metric if and only if the largest portion of losses is trimmed. Overall, this study analyzes robustness properties of In-CVaR based nonlinear regression models under both perturbation and contamination, which illustrates the advantages of In-CVaR risk measure over conditional value-at-risk and expectation for robust regression in both theory and numerical experiments.
Similar Papers
Confidence Intervals for Linear Models with Arbitrary Noise Contamination
Statistics Theory
Finds reliable answers even with bad data.
Estimation of the Coefficient of Variation of Weibull Distribution under Type-I Progressively Interval Censoring: A Simulation-based Approach
Methodology
Helps predict when things will break.
On Design of Representative Distributionally Robust Formulations for Evaluation of Tail Risk Measures
Risk Management
Finds the worst possible money loss safely.