Score: 0

Optimal Transport Based Testing in Factorial Design

Published: September 17, 2025 | arXiv ID: 2509.13970v1

By: Michel Groppe , Linus Niemöller , Shayan Hundrieser and more

Potential Business Impact:

Tests if groups of data are different.

Business Areas:
A/B Testing Data and Analytics

We introduce a general framework for testing statistical hypotheses for probability measures supported on finite spaces, which is based on optimal transport (OT). These tests are inspired by the analysis of variance (ANOVA) and its nonparametric counterparts. They allow for testing linear relationships in factorial designs between discrete probability measures and are based on pairwise comparisons of the OT distance and corresponding barycenters. To this end, we derive under the null hypotheses and (local) alternatives the asymptotic distribution of empirical OT costs and the empirical OT barycenter cost functional as the optimal value of linear programs with random objective function. In particular, we extend existing techniques for probability to signed measures and show directional Hadamard differentiability and the validity of the functional delta method. We discuss computational issues, permutation and bootstrap tests, and back up our findings with simulations. We illustrate our methodology on two datasets from cellular biophysics and biometric identification.

Country of Origin
🇩🇪 Germany

Page Count
45 pages

Category
Mathematics:
Statistics Theory