Local Search-based Individually Fair Clustering with Outliers
By: Binita Maity, Shrutimoy Das, Anirban Dasgupta
Potential Business Impact:
Groups data fairly, even with bad data.
In this paper, we present a local search-based algorithm for individually fair clustering in the presence of outliers. We consider the individual fairness definition proposed in Jung et al., which requires that each of the $n$ points in the dataset must have one of the $k$ centers within its $n/k$ nearest neighbors. However, if the dataset is known to contain outliers, the set of fair centers obtained under this definition might be suboptimal for non-outlier points. In order to address this issue, we propose a method that discards a set of points marked as outliers and computes the set of fair centers for the remaining non-outlier points. Our method utilizes a randomized variant of local search, which makes it scalable to large datasets. We also provide an approximation guarantee of our method as well as a bound on the number of outliers discarded. Additionally, we demonstrate our claims experimentally on a set of real-world datasets.
Similar Papers
Improved Streaming Algorithm for Fair $k$-Center Clustering
Data Structures and Algorithms
Helps computers group data fairly for everyone.
Fair Center Clustering in Sliding Windows
Data Structures and Algorithms
Finds best spots for groups, fairly, using less space.
Euclidean k-center Fair Clusterings
Computational Geometry
Divides groups fairly, ensuring representation limits.