Topology-Independent Robustness of the Weighted Mean under Label Poisoning Attacks in Heterogeneous Decentralized Learning
By: Jie Peng , Weiyu Li , Stefan Vlaski and more
Potential Business Impact:
Makes smart systems safer from bad data.
Robustness to malicious attacks is crucial for practical decentralized signal processing and machine learning systems. A typical example of such attacks is label poisoning, meaning that some agents possess corrupted local labels and share models trained on these poisoned data. To defend against malicious attacks, existing works often focus on designing robust aggregators; meanwhile, the weighted mean aggregator is typically considered a simple, vulnerable baseline. This paper analyzes the robustness of decentralized gradient descent under label poisoning attacks, considering both robust and weighted mean aggregators. Theoretical results reveal that the learning errors of robust aggregators depend on the network topology, whereas the performance of weighted mean aggregator is topology-independent. Remarkably, the weighted mean aggregator, although often considered vulnerable, can outperform robust aggregators under sufficient heterogeneity, particularly when: (i) the global contamination rate (i.e., the fraction of poisoned agents for the entire network) is smaller than the local contamination rate (i.e., the maximal fraction of poisoned neighbors for the regular agents); (ii) the network of regular agents is disconnected; or (iii) the network of regular agents is sparse and the local contamination rate is high. Empirical results support our theoretical findings, highlighting the important role of network topology in the robustness to label poisoning attacks.
Similar Papers
Like Oil and Water: Group Robustness Methods and Poisoning Defenses May Be at Odds
Machine Learning (CS)
Makes AI fair for everyone, not just some.
On Robustness of Linear Classifiers to Targeted Data Poisoning
Machine Learning (CS)
Finds fake data that tricks computer learning.
On Robust Aggregation for Distributed Data
Methodology
Keeps computer data safe while still analyzing it.