Fair Clustering with Clusterlets
By: Mattia Setzu, Riccardo Guidotti
Potential Business Impact:
Makes computer groups fair and easy to find.
Given their widespread usage in the real world, the fairness of clustering methods has become of major interest. Theoretical results on fair clustering show that fairness enjoys transitivity: given a set of small and fair clusters, a trivial centroid-based clustering algorithm yields a fair clustering. Unfortunately, discovering a suitable starting clustering can be computationally expensive, rather complex or arbitrary. In this paper, we propose a set of simple \emph{clusterlet}-based fuzzy clustering algorithms that match single-class clusters, optimizing fair clustering. Matching leverages clusterlet distance, optimizing for classic clustering objectives, while also regularizing for fairness. Empirical results show that simple matching strategies are able to achieve high fairness, and that appropriate parameter tuning allows to achieve high cohesion and low overlap.
Similar Papers
Fair Clustering via Alignment
Machine Learning (CS)
Makes computer groups fair without losing quality.
Towards Fair Representation: Clustering and Consensus
Machine Learning (CS)
Makes groups in data fair for everyone.
Fair Bayesian Model-Based Clustering
Machine Learning (Stat)
Groups data fairly without knowing group count.