Depth based trimmed means
By: Alejandro Cholaquidis , Ricardo Fraiman , Leonardo Moreno and more
Potential Business Impact:
Finds the true center of messy data.
Robust estimation of location is a fundamental problem in statistics, particularly in scenarios where data contamination by outliers or model misspecification is a concern. In univariate settings, methods such as the sample median and trimmed means balance robustness and efficiency by mitigating the influence of extreme observations. This paper extends these robust techniques to the multivariate context through the use of data depth functions, which provide a natural means to order and rank multidimensional data. We review several depth measures and discuss their role in generalizing trimmed mean estimators beyond one dimension. Our main contributions are twofold: first, we prove the almost sure consistency of the multivariate trimmed mean estimator under mixing conditions; second, we establish a general limit distribution theorem for a broad family of depth-based estimators, encompassing popular examples such as Tukey's and projection depth. These theoretical advancements not only enhance the understanding of robust location estimation in high-dimensional settings but also offer practical guidelines for applications in areas such as machine learning, economic analysis, and financial risk assessment. A small example with simulated data is performed, varying the depth measure used and the percentage of trimmed data.
Similar Papers
Some insights into depth estimators for location and scatter in the multivariate setting
Statistics Theory
Finds the most reliable data, ignoring bad information.
Data Depth as a Risk
Machine Learning (Stat)
Finds weird data points using simple math.
Asymptotic linear dependence and ellipse statistics for multivariate two-sample homogeneity test
Methodology
Finds if two groups of data are different.