Reinforcement Learning for Physical Layer Communications
Philippe Mary, Visa Koivunen, Christophe Moy

TL;DR
This paper provides a comprehensive overview of applying reinforcement learning techniques, including basic theory, algorithms, and practical examples, to optimize the physical layer of wireless communication systems.
Contribution
It introduces the application of RL, DRL, and MAB frameworks specifically tailored for physical layer communication problems, with illustrative examples and modeling insights.
Findings
RL algorithms can optimize wireless physical layer performance
Deep RL enhances decision-making in communication systems
Modeling RL problems aids in designing efficient communication protocols
Abstract
In this chapter, we will give comprehensive examples of applying RL in optimizing the physical layer of wireless communications by defining different class of problems and the possible solutions to handle them. In Section 9.2, we present all the basic theory needed to address a RL problem, i.e. Markov decision process (MDP), Partially observable Markov decision process (POMDP), but also two very important and widely used algorithms for RL, i.e. the Q-learning and SARSA algorithms. We also introduce the deep reinforcement learning (DRL) paradigm and the section ends with an introduction to the multi-armed bandits (MAB) framework. Section 9.3 focuses on some toy examples to illustrate how the basic concepts of RL are employed in communication systems. We present applications extracted from literature with simplified system models using similar notation as in Section 9.2 of this Chapter.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced MIMO Systems Optimization · ICT Impact and Policies
MethodsQ-Learning · Sarsa
