Reinforcement Learning for Sequential Decoding of Generalized LDPC Codes
Salman Habib, David G. M. Mitchell

TL;DR
This paper introduces a reinforcement learning approach for sequential decoding of generalized LDPC codes, optimizing scheduling policies to improve decoding performance over traditional belief propagation methods.
Contribution
It models the decoding process as an MDP and trains an RL agent to optimize scheduling, achieving better decoding performance than standard methods.
Findings
RL-based decoder outperforms standard BP flooding decoder
RL scheduling reduces decoding complexity
Significant performance gains over random scheduling
Abstract
In this work, we propose reinforcement learning (RL) for sequential decoding of moderate length generalized low-density parity-check (GLDPC) codes. Here, sequential decoding refers to scheduling all the generalized constraint nodes (GCNs) and single parity-check nodes (SPCNs) of a GLDPC code serially in each iteration. A GLDPC decoding environment is modeled as a finite Markov decision process (MDP) in which the state-space comprises of all possible sequences of hard-decision values of the variables nodes (VNs) connected to the scheduled GCN or SPCN, and the action-space of the MDP consists of all possible actions (GCN and SPCN scheduling). The goal of RL is to determine an optimized scheduling policy, i.e., one that results in a decoded codeword by minimizing the complexity of the belief propagation (BP) decoder. For training, we consider the proportion of correct bits at the output of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsError Correcting Code Techniques · Advanced Wireless Communication Techniques
