Multi-Agent Actor-Critic with Harmonic Annealing Pruning for Dynamic Spectrum Access Systems
By: George Stamatelis, Angelos-Nikolaos Kanatas, George C. Alexandropoulos
Potential Business Impact:
Makes radios smarter to share airwaves better.
Multi-Agent Deep Reinforcement Learning (MADRL) has emerged as a powerful tool for optimizing decentralized decision-making systems in complex settings, such as Dynamic Spectrum Access (DSA). However, deploying deep learning models on resource-constrained edge devices remains challenging due to their high computational cost. To address this challenge, in this paper, we present a novel sparse recurrent MARL framework integrating gradual neural network pruning into the independent actor global critic paradigm. Additionally, we introduce a harmonic annealing sparsity scheduler, which achieves comparable, and in certain cases superior, performance to standard linear and polynomial pruning schedulers at large sparsities. Our experimental investigation demonstrates that the proposed DSA framework can discover superior policies, under diverse training conditions, outperforming conventional DSA, MADRL baselines, and state-of-the-art pruning techniques.
Similar Papers
SPECTra: Scalable Multi-Agent Reinforcement Learning with Permutation-Free Networks
Machine Learning (CS)
Lets robots work together with any number of friends.
Task Specific Sharpness Aware O-RAN Resource Management using Multi Agent Reinforcement Learning
Artificial Intelligence
Makes phone networks smarter and faster.
Federated Multi-Agent Reinforcement Learning for Privacy-Preserving and Energy-Aware Resource Management in 6G Edge Networks
Machine Learning (CS)
Makes phones work faster and save battery.