Score: 0

Robust Clustered Federated Learning for Heterogeneous High-dimensional Data

Published: October 12, 2025 | arXiv ID: 2510.10576v1

By: Changxin Yang, Zhongyi Zhu, Heng Lian

Potential Business Impact:

Helps computers learn from private data better.

Business Areas:
Crowdsourcing Collaboration

Federated learning has attracted significant attention as a privacy-preserving framework for training personalised models on multi-source heterogeneous data. However, most existing approaches are unable to handle scenarios where subgroup structures coexist alongside within-group heterogeneity. In this paper, we propose a federated learning algorithm that addresses general heterogeneity through adaptive clustering. Specifically, our method partitions tasks into subgroups to address substantial between-group differences while enabling efficient information sharing among similar tasks within each group. Furthermore, we integrate the Huber loss and Iterative Hard Thresholding (IHT) to tackle the challenges of high dimensionality and heavy-tailed distributions. Theoretically, we establish convergence guarantees, derive non-asymptotic error bounds, and provide recovery guarantees for the latent cluster structure. Extensive simulation studies and real-data applications further demonstrate the effectiveness and adaptability of our approach.

Page Count
32 pages

Category
Statistics:
Methodology