Multilevel Sampling in Algebraic Statistics
By: Nathan Kirk, Ivan Gvozdanović, Sonja Petrović
Potential Business Impact:
Helps computers explore tricky math problems faster.
This paper proposes a multilevel sampling algorithm for fiber sampling problems in algebraic statistics, inspired by Henry Wynn's suggestion to adapt multilevel Monte Carlo (MLMC) ideas to discrete models. Focusing on log-linear models, we sample from high-dimensional lattice fibers defined by algebraic constraints. Building on Markov basis methods and results from Diaconis and Sturmfels, our algorithm uses variable step sizes to accelerate exploration and reduce the need for long burn-in. We introduce a novel Fiber Coverage Score (FCS) based on Voronoi partitioning to assess sample quality, and highlight the utility of the Maximum Mean Discrepancy (MMD) quality metric. Simulations on benchmark fibers show that multilevel sampling outperforms naive MCMC approaches. Our results demonstrate that multilevel methods, when properly applied, provide practical benefits for discrete sampling in algebraic statistics.
Similar Papers
A Framework to Analyze Multiscale Sampling MCMC Methods
Methodology
Helps computers solve tricky math problems faster.
Exploiting Inexact Computations in Multilevel Sampling Methods
Numerical Analysis
Makes computer simulations run much faster.
Multilevel Monte Carlo Metamodeling for Variance Function Estimation
Methodology
Makes computer models run faster and more accurately.