Reinforcement Learning Evaluation and Solution for the Feedback Capacity   of the Ising Channel with Large Alphabet

Ziv Aharoni; Oron Sabag; Haim Henri Permuter

arXiv:2008.07983·cs.IT·August 19, 2020·5 cites

Reinforcement Learning Evaluation and Solution for the Feedback Capacity of the Ising Channel with Large Alphabet

Ziv Aharoni, Oron Sabag, Haim Henri Permuter

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning approach to evaluate and approximate the feedback capacity of the Ising channel with large alphabets, overcoming computational challenges of traditional methods.

Contribution

It develops a neural network-based RL method to estimate feedback capacity for channels with large alphabets and derives analytic bounds and coding schemes from the structure of solutions.

Findings

01

RL provides a numerical lower bound on feedback capacity.

02

Analytic solutions are derived for alphabet sizes up to 8.

03

Asymptotic coding schemes are proposed for large alphabets.

Abstract

We propose a new method to compute the feedback capacity of unifilar finite state channels (FSCs) with memory using reinforcement learning (RL). The feedback capacity was previously estimated using its formulation as a Markov decision process (MDP) with dynamic programming (DP) algorithms. However, their computational complexity grows exponentially with the channel alphabet size. Therefore, we use RL, and specifically its ability to parameterize value functions and policies with neural networks, to evaluate numerically the feedback capacity of channels with a large alphabet size. The outcome of the RL algorithm is a numerical lower bound on the feedback capacity, which is used to reveal the structure of the optimal solution. The structure is modeled by a graph-based auxiliary random variable that is utilized to derive an analytic upper bound on the feedback capacity with the duality…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · stochastic dynamics and bifurcation · Neural Networks and Applications