Distributed Learning in Markovian Restless Bandits over Interference Graphs for Stable Spectrum Sharing

Liad Lea Didi; Kobi Cohen

arXiv:2512.17161·cs.LG·December 22, 2025

Distributed Learning in Markovian Restless Bandits over Interference Graphs for Stable Spectrum Sharing

Liad Lea Didi, Kobi Cohen

PDF

Open Access

TL;DR

This paper introduces SMILE, a distributed learning algorithm for spectrum sharing in wireless networks modeled by interference graphs, achieving stable, interference-aware channel allocation with proven convergence and low regret.

Contribution

It is the first to establish global Gale-Shapley stability in a stochastic restless environment and develops a communication-efficient algorithm combining restless bandit learning with graph coordination.

Findings

01

SMILE converges to the optimal stable allocation.

02

SMILE achieves logarithmic regret compared to full-knowledge benchmarks.

03

Simulations confirm robustness, scalability, and efficiency of SMILE.

Abstract

We study distributed learning for spectrum access and sharing among multiple cognitive communication entities, such as cells, subnetworks, or cognitive radio users (collectively referred to as cells), in communication-constrained wireless networks modeled by interference graphs. Our goal is to achieve a globally stable and interference-aware channel allocation. Stability is defined through a generalized Gale-Shapley multi-to-one matching, a well-established solution concept in wireless resource allocation. We consider wireless networks where L cells share S orthogonal channels and cannot simultaneously use the same channel as their neighbors. Each channel evolves as an unknown restless Markov process with cell-dependent rewards, making this the first work to establish global Gale-Shapley stability for channel allocation in a stochastic, temporally varying restless environment. To…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Cognitive Radio Networks and Spectrum Sensing · Age of Information Optimization