Reinforcement Learning for Physical Layer Communications

Philippe Mary; Visa Koivunen; Christophe Moy

arXiv:2106.11595·cs.AI·July 2, 2021

Reinforcement Learning for Physical Layer Communications

Philippe Mary, Visa Koivunen, Christophe Moy

PDF

Open Access 1 Repo

TL;DR

This paper provides a comprehensive overview of applying reinforcement learning techniques, including basic theory, algorithms, and practical examples, to optimize the physical layer of wireless communication systems.

Contribution

It introduces the application of RL, DRL, and MAB frameworks specifically tailored for physical layer communication problems, with illustrative examples and modeling insights.

Findings

01

RL algorithms can optimize wireless physical layer performance

02

Deep RL enhances decision-making in communication systems

03

Modeling RL problems aids in designing efficient communication protocols

Abstract

In this chapter, we will give comprehensive examples of applying RL in optimizing the physical layer of wireless communications by defining different class of problems and the possible solutions to handle them. In Section 9.2, we present all the basic theory needed to address a RL problem, i.e. Markov decision process (MDP), Partially observable Markov decision process (POMDP), but also two very important and widely used algorithms for RL, i.e. the Q-learning and SARSA algorithms. We also introduce the deep reinforcement learning (DRL) paradigm and the section ends with an introduction to the multi-armed bandits (MAB) framework. Section 9.3 focuses on some toy examples to illustrate how the basic concepts of RL are employed in communication systems. We present applications extracted from literature with simplified system models using similar notation as in Section 9.2 of this Chapter.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://bitbucket.org/scee_ietr/malin-multi-armed-bandit-learning-for-iot-networks-with-grc
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced MIMO Systems Optimization · ICT Impact and Policies

MethodsQ-Learning · Sarsa