Bayesian Nonparametric Modelling for Model-Free Reinforcement Learning in LTE-LAA and Wi-Fi Coexistence
Po-Kan Shih, Bahman Moraffah

TL;DR
This paper introduces a Bayesian nonparametric reinforcement learning algorithm for fair and efficient coexistence of LTE-LAA and Wi-Fi in unlicensed spectrum, modeling the problem as a Dec-POMDP and using variational inference for efficiency.
Contribution
It develops a novel nonparametric Bayesian RL approach for decentralized spectrum sharing, addressing uncertainty and fairness in LTE-LAA and Wi-Fi coexistence.
Findings
Achieves high policy value with few learning iterations
Ensures fair spectrum sharing between LTE-LAA and Wi-Fi
Provides computationally efficient inference via variational methods
Abstract
With the arrival of next generation wireless communication, a growing number of new applications like internet of things, autonomous driving systems, and drone are crowding the unlicensed spectrum. Licensed network such as the long-term evolution (LTE) also comes to the unlicensed spectrum for better providing high-capacity contents with low cost. However, LTE was not designed to share resources with others. Previous solutions usually work on fixed scenarios. This work features a Nonparametric Bayesian reinforcement learning algorithm to cope with the coexistence between Wi-Fi and LTE licensed assisted access (LTE-LAA) agents in 5 GHz unlicensed spectrum. The coexistence problem is modeled as a decentralized partially-observable Markov decision process (Dec-POMDP) and Bayesian inference is adopted for policy learning with nonparametric prior to accommodate the uncertainty of policy for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced MIMO Systems Optimization · Advanced Bandit Algorithms Research · Distributed Sensor Networks and Detection Algorithms
MethodsVariational Inference
