Fair Clustering in the Sliding Window Model
By: Vincent Cohen-Addad , Shaofeng H. -C. Jiang , Qiaoyuan Yang and more
Potential Business Impact:
Makes computer groups fairer, even with changing data.
We study streaming algorithms for proportionally fair clustering, a notion originally suggested by Chierichetti et. al. (2017), in the sliding window model. We show that although there exist efficient streaming algorithms in the insertion-only model, surprisingly no algorithm can achieve finite multiplicative ratio without violating the fairness constraint in the sliding window. Hence, the problem of fair clustering is a rare separation between the insertion-only streaming model and the sliding window model. On the other hand, we show that if the fairness constraint is relaxed by a multiplicative $(1+\varepsilon)$ factor, there exists a $(1 + \varepsilon)$-approximate sliding window algorithm that uses $\text{poly}(k\varepsilon^{-1}\log n)$ space. This achieves essentially the best parameters (up to degree in the polynomial) provided the aforementioned lower bound. We also implement a number of empirical evaluations on real datasets to complement our theoretical results.
Similar Papers
Fair Center Clustering in Sliding Windows
Data Structures and Algorithms
Finds best spots for groups, fairly, using less space.
Improved Streaming Algorithm for Fair $k$-Center Clustering
Data Structures and Algorithms
Helps computers group data fairly for everyone.
Tight Bounds for Heavy-Hitters and Moment Estimation in the Sliding Window Model
Data Structures and Algorithms
Finds important items in fast-moving data streams.