A New Finite-Horizon Dynamic Programming Analysis of Nonanticipative Rate-Distortion Function for Markov Sources
Zixuan He, Charalambos D. Charalambous, Photios A. Stavrou

TL;DR
This paper introduces a novel dynamic programming approach to compute the nonanticipative rate-distortion function for Markov sources, providing a practical algorithm with convergence guarantees for finite-horizon lossy compression.
Contribution
It develops a new information structure, convexity results, and a dynamic alternating minimization algorithm for the NRDF, enabling near-optimal solutions with provable convergence.
Findings
The proposed algorithms effectively approximate the NRDF for Markov sources.
The methodology is validated through simulations on binary Markov processes.
The approach provides a near-optimal solution with convergence guarantees.
Abstract
This paper deals with the computation of a non-asymptotic lower bound by means of the nonanticipative rate-distortion function (NRDF) on the discrete-time zero-delay variable-rate lossy compression problem for discrete Markov sources with per-stage, single-letter distortion. First, we derive a new information structure of the NRDF for Markov sources and single-letter distortions. Second, we derive new convexity results on the NRDF, which facilitate the use of Lagrange duality theorem to cast the problem as an unconstrained partially observable finite-time horizon stochastic dynamic programming (DP) algorithm subject to a probabilistic state (belief state) that summarizes the past information about the reproduction symbols and takes values in a continuous state space. Instead of approximating the DP algorithm directly, we use Karush-Kuhn-Tucker (KKT) conditions to find an implicit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics
