Coordinated Anti-Jamming Resilience in Swarm Networks via Multi-Agent Reinforcement Learning
By: Bahman Abolhassani , Tugba Erpek , Kemal Davaslioglu and more
Reactive jammers pose a severe security threat to robotic-swarm networks by selectively disrupting inter-agent communications and undermining formation integrity and mission success. Conventional countermeasures such as fixed power control or static channel hopping are largely ineffective against such adaptive adversaries. This paper presents a multi-agent reinforcement learning (MARL) framework based on the QMIX algorithm to improve the resilience of swarm communications under reactive jamming. We consider a network of multiple transmitter-receiver pairs sharing channels while a reactive jammer with Markovian threshold dynamics senses aggregate power and reacts accordingly. Each agent jointly selects transmit frequency (channel) and power, and QMIX learns a centralized but factorizable action-value function that enables coordinated yet decentralized execution. We benchmark QMIX against a genie-aided optimal policy in a no-channel-reuse setting, and against local Upper Confidence Bound (UCB) and a stateless reactive policy in a more general fading regime with channel reuse enabled. Simulation results show that QMIX rapidly converges to cooperative policies that nearly match the genie-aided bound, while achieving higher throughput and lower jamming incidence than the baselines, thereby demonstrating MARL's effectiveness for securing autonomous swarms in contested environments.
Similar Papers
Multi-Agent Deep Reinforcement Learning for Collaborative UAV Relay Networks under Jamming Atatcks
Networking and Internet Architecture
Drones learn to talk without crashing or jamming.
How to Combat Reactive and Dynamic Jamming Attacks with Reinforcement Learning
Machine Learning (CS)
Learns to send messages even when jammed.
Dynamic one-time delivery of critical data by small and sparse UAV swarms: a model problem for MARL scaling studies
Systems and Control
Drones learn to deliver packages without crashing.