Decoding Polar Codes with Reinforcement Learning

Nghia Doan; Seyyed Ali Hashemi; Warren Gross

arXiv:2009.06796·cs.IT·September 16, 2020

Decoding Polar Codes with Reinforcement Learning

Nghia Doan, Seyyed Ali Hashemi, Warren Gross

PDF

TL;DR

This paper introduces a reinforcement learning-based method to optimize factor-graph permutations in polar codes, significantly enhancing decoding performance by learning during the decoding process.

Contribution

It formalizes factor-graph permutation selection as a multi-armed bandit problem and develops an online-learning decoder for improved error correction in polar codes.

Findings

01

Achieves 0.125 dB gain at FER of 10^{-4} for 5G polar codes

02

Uses reinforcement learning to adapt permutation selection during decoding

03

Outperforms random permutation selection methods

Abstract

In this paper we address the problem of selecting factor-graph permutations of polar codes under belief propagation (BP) decoding to significantly improve the error-correction performance of the code. In particular, we formalize the factor-graph permutation selection as the multi-armed bandit problem in reinforcement learning and propose a decoder that acts like an online-learning agent that learns to select the good factor-graph permutations during the course of decoding. We use state-of-the-art algorithms for the multi-armed bandit problem and show that for a 5G polar codes of length 128 with 64 information bits, the proposed decoder has an error-correction performance gain of around 0.125 dB at the target frame error rate of 10^{-4}, when compared to the approach that randomly selects the factor-graph permutations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.