Reinforcement Learning Evaluation and Solution for the Feedback Capacity of the Ising Channel with Large Alphabet
Ziv Aharoni, Oron Sabag, Haim Henri Permuter

TL;DR
This paper introduces a reinforcement learning approach to evaluate and approximate the feedback capacity of the Ising channel with large alphabets, overcoming computational challenges of traditional methods.
Contribution
It develops a neural network-based RL method to estimate feedback capacity for channels with large alphabets and derives analytic bounds and coding schemes from the structure of solutions.
Findings
RL provides a numerical lower bound on feedback capacity.
Analytic solutions are derived for alphabet sizes up to 8.
Asymptotic coding schemes are proposed for large alphabets.
Abstract
We propose a new method to compute the feedback capacity of unifilar finite state channels (FSCs) with memory using reinforcement learning (RL). The feedback capacity was previously estimated using its formulation as a Markov decision process (MDP) with dynamic programming (DP) algorithms. However, their computational complexity grows exponentially with the channel alphabet size. Therefore, we use RL, and specifically its ability to parameterize value functions and policies with neural networks, to evaluate numerically the feedback capacity of channels with a large alphabet size. The outcome of the RL algorithm is a numerical lower bound on the feedback capacity, which is used to reveal the structure of the optimal solution. The structure is modeled by a graph-based auxiliary random variable that is utilized to derive an analytic upper bound on the feedback capacity with the duality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · stochastic dynamics and bifurcation · Neural Networks and Applications
