Score: 0

Depth based trimmed means

Published: May 6, 2025 | arXiv ID: 2505.03523v1

By: Alejandro Cholaquidis , Ricardo Fraiman , Leonardo Moreno and more

Potential Business Impact:

Finds the true center of messy data.

Business Areas:

Analytics Data and Analytics

Robust estimation of location is a fundamental problem in statistics, particularly in scenarios where data contamination by outliers or model misspecification is a concern. In univariate settings, methods such as the sample median and trimmed means balance robustness and efficiency by mitigating the influence of extreme observations. This paper extends these robust techniques to the multivariate context through the use of data depth functions, which provide a natural means to order and rank multidimensional data. We review several depth measures and discuss their role in generalizing trimmed mean estimators beyond one dimension. Our main contributions are twofold: first, we prove the almost sure consistency of the multivariate trimmed mean estimator under mixing conditions; second, we establish a general limit distribution theorem for a broad family of depth-based estimators, encompassing popular examples such as Tukey's and projection depth. These theoretical advancements not only enhance the understanding of robust location estimation in high-dimensional settings but also offer practical guidelines for applications in areas such as machine learning, economic analysis, and financial risk assessment. A small example with simulated data is performed, varying the depth measure used and the percentage of trimmed data.

Some insights into depth estimators for location and scatter in the multivariate setting

Statistics Theory

Finds the most reliable data, ignoring bad information.

12 May 2025 0

87%

Data Depth as a Risk

Machine Learning (Stat)

Finds weird data points using simple math.

11 Jul 2025 0

87%

Asymptotic linear dependence and ellipse statistics for multivariate two-sample homogeneity test

Methodology

Finds if two groups of data are different.

24 Nov 2025 0

View PDF Login to Bookmark

Page Count

23 pages

Depth based trimmed means

Finds the true center of messy data.

Technical Abstract

Some insights into depth estimators for location and scatter in the multivariate setting

Data Depth as a Risk

Asymptotic linear dependence and ellipse statistics for multivariate two-sample homogeneity test