Distributed Learning in Markovian Restless Bandits over Interference Graphs for Stable Spectrum Sharing
By: Liad Lea Didi, Kobi Cohen
We study distributed learning for spectrum access and sharing among multiple cognitive communication entities, such as cells, subnetworks, or cognitive radio users (collectively referred to as cells), in communication-constrained wireless networks modeled by interference graphs. Our goal is to achieve a globally stable and interference-aware channel allocation. Stability is defined through a generalized Gale-Shapley multi-to-one matching, a well-established solution concept in wireless resource allocation. We consider wireless networks where L cells share S orthogonal channels and cannot simultaneously use the same channel as their neighbors. Each channel evolves as an unknown restless Markov process with cell-dependent rewards, making this the first work to establish global Gale-Shapley stability for channel allocation in a stochastic, temporally varying restless environment. To address this challenge, we develop SMILE (Stable Multi-matching with Interference-aware LEarning), a communication-efficient distributed learning algorithm that integrates restless bandit learning with graph-constrained coordination. SMILE enables cells to distributedly balance exploration of unknown channels with exploitation of learned information. We prove that SMILE converges to the optimal stable allocation and achieves logarithmic regret relative to a genie with full knowledge of expected utilities. Simulations validate the theoretical guarantees and demonstrate SMILE's robustness, scalability, and efficiency across diverse spectrum-sharing scenarios.
Similar Papers
Distributed Learning for Reliable and Timely Communication in 6G Industrial Subnetworks
Networking and Internet Architecture
Helps machines talk faster without crashing.
Distributed resource allocation in cognitive radio networks with a game learning approach to improve aggregate system capacity
Networking and Internet Architecture
Lets radios share airwaves smartly.
Learning-Based Channel Access in Wi-Fi: A Multi-Armed Bandit Approach
Networking and Internet Architecture
Makes Wi-Fi faster by learning how to share.