Score: 0

Local Search-based Individually Fair Clustering with Outliers

Published: October 7, 2025 | arXiv ID: 2510.06130v1

By: Binita Maity, Shrutimoy Das, Anirban Dasgupta

Potential Business Impact:

Groups data fairly, even with bad data.

Business Areas:
Crowdsourcing Collaboration

In this paper, we present a local search-based algorithm for individually fair clustering in the presence of outliers. We consider the individual fairness definition proposed in Jung et al., which requires that each of the $n$ points in the dataset must have one of the $k$ centers within its $n/k$ nearest neighbors. However, if the dataset is known to contain outliers, the set of fair centers obtained under this definition might be suboptimal for non-outlier points. In order to address this issue, we propose a method that discards a set of points marked as outliers and computes the set of fair centers for the remaining non-outlier points. Our method utilizes a randomized variant of local search, which makes it scalable to large datasets. We also provide an approximation guarantee of our method as well as a bound on the number of outliers discarded. Additionally, we demonstrate our claims experimentally on a set of real-world datasets.

Country of Origin
🇮🇳 India

Page Count
12 pages

Category
Computer Science:
Data Structures and Algorithms