Robust and Sparse Estimation of Unbounded Density Ratio under Heavy Contamination
By: Ryosuke Nagumo, Hironori Fujisawa
Potential Business Impact:
Finds bad data in computer learning.
We examine the non-asymptotic properties of robust density ratio estimation (DRE) in contaminated settings. Weighted DRE is the most promising among existing methods, exhibiting doubly strong robustness from an asymptotic perspective. This study demonstrates that Weighted DRE achieves sparse consistency even under heavy contamination within a non-asymptotic framework. This method addresses two significant challenges in density ratio estimation and robust estimation. For density ratio estimation, we provide the non-asymptotic properties of estimating unbounded density ratios under the assumption that the weighted density ratio function is bounded. For robust estimation, we introduce a non-asymptotic framework for doubly strong robustness under heavy contamination, assuming that at least one of the following conditions holds: (i) contamination ratios are small, and (ii) outliers have small weighted values. This work provides the first non-asymptotic analysis of strong robustness under heavy contamination.
Similar Papers
Projection Pursuit Density Ratio Estimation
Machine Learning (Stat)
Finds patterns in complex data better.
Estimating Unbounded Density Ratios: Applications in Error Control under Covariate Shift
Machine Learning (Stat)
Makes computer learning better with tricky data.
Doubly robust integration of nonprobability and probability survey data
Methodology
Combines survey data for more accurate results.