Score: 0

Robust and Sparse Estimation of Unbounded Density Ratio under Heavy Contamination

Published: December 10, 2025 | arXiv ID: 2512.09266v1

By: Ryosuke Nagumo, Hironori Fujisawa

Potential Business Impact:

Finds bad data in computer learning.

Business Areas:
A/B Testing Data and Analytics

We examine the non-asymptotic properties of robust density ratio estimation (DRE) in contaminated settings. Weighted DRE is the most promising among existing methods, exhibiting doubly strong robustness from an asymptotic perspective. This study demonstrates that Weighted DRE achieves sparse consistency even under heavy contamination within a non-asymptotic framework. This method addresses two significant challenges in density ratio estimation and robust estimation. For density ratio estimation, we provide the non-asymptotic properties of estimating unbounded density ratios under the assumption that the weighted density ratio function is bounded. For robust estimation, we introduce a non-asymptotic framework for doubly strong robustness under heavy contamination, assuming that at least one of the following conditions holds: (i) contamination ratios are small, and (ii) outliers have small weighted values. This work provides the first non-asymptotic analysis of strong robustness under heavy contamination.

Page Count
38 pages

Category
Statistics:
Machine Learning (Stat)