Score: 2

Fair Clustering in the Sliding Window Model

Published: March 7, 2025 | arXiv ID: 2503.05173v1

By: Vincent Cohen-Addad , Shaofeng H. -C. Jiang , Qiaoyuan Yang and more

BigTech Affiliations: Google

Potential Business Impact:

Makes computer groups fairer, even with changing data.

Business Areas:
A/B Testing Data and Analytics

We study streaming algorithms for proportionally fair clustering, a notion originally suggested by Chierichetti et. al. (2017), in the sliding window model. We show that although there exist efficient streaming algorithms in the insertion-only model, surprisingly no algorithm can achieve finite multiplicative ratio without violating the fairness constraint in the sliding window. Hence, the problem of fair clustering is a rare separation between the insertion-only streaming model and the sliding window model. On the other hand, we show that if the fairness constraint is relaxed by a multiplicative $(1+\varepsilon)$ factor, there exists a $(1 + \varepsilon)$-approximate sliding window algorithm that uses $\text{poly}(k\varepsilon^{-1}\log n)$ space. This achieves essentially the best parameters (up to degree in the polynomial) provided the aforementioned lower bound. We also implement a number of empirical evaluations on real datasets to complement our theoretical results.

Country of Origin
πŸ‡¨πŸ‡³ πŸ‡ΊπŸ‡Έ United States, China

Page Count
35 pages

Category
Computer Science:
Data Structures and Algorithms