Multi-Pass Streaming Lower Bounds for Uniformity Testing
By: Qian Li, Xin Lyu
Potential Business Impact:
Finds hidden patterns in random data streams.
We prove multi-pass streaming lower bounds for uniformity testing over a domain of size $2m$. The tester receives a stream of $n$ i.i.d. samples and must distinguish (i) the uniform distribution on $[2m]$ from (ii) a Paninski-style planted distribution in which, for each pair $(2i-1,2i)$, the probabilities are biased left or right by $\epsilon/2m$. We show that any $\ell$-pass streaming algorithm using space $s$ and achieving constant advantage must satisfy the tradeoff $sn\ell=\tilde{\Omega}(m/\epsilon^2)$. This extends the one-pass lower bound of Diakonikolas, Gouleakis, Kane, and Rao (2019) to multiple passes. Our proof has two components. First, we develop a hybrid argument, inspired by Dinur (2020), that reduces streaming to two-player communication problems. This reduction relies on a new perspective on hardness: we identify the source of hardness as uncertainty in the bias directions, rather than the collision locations. Second, we prove a strong lower bound for a basic two-player communication task, in which Alice and Bob must decide whether two random sign vectors $Y^a,Y^b\in\{\pm 1\}^m$ are independent or identical, yet they cannot observe the signs directly--only noisy local views of each coordinate. Our techniques may be of independent use for other streaming problems with stochastic inputs.
Similar Papers
Tight Bounds for Low-Error Frequency Moment Estimation and the Power of Multiple Passes
Data Structures and Algorithms
Stores data streams more efficiently for better analysis.
A Dichotomy Theorem for Multi-Pass Streaming CSPs
Computational Complexity
Makes computers solve tough problems with less memory.
Uniformity Testing under User-Level Local Privacy
Data Structures and Algorithms
Protects user data while checking data patterns.