A New Finite-Horizon Dynamic Programming Analysis of Nonanticipative   Rate-Distortion Function for Markov Sources

Zixuan He; Charalambos D. Charalambous; Photios A. Stavrou

arXiv:2411.11698·cs.IT·November 19, 2024

A New Finite-Horizon Dynamic Programming Analysis of Nonanticipative Rate-Distortion Function for Markov Sources

Zixuan He, Charalambos D. Charalambous, Photios A. Stavrou

PDF

Open Access

TL;DR

This paper introduces a novel dynamic programming approach to compute the nonanticipative rate-distortion function for Markov sources, providing a practical algorithm with convergence guarantees for finite-horizon lossy compression.

Contribution

It develops a new information structure, convexity results, and a dynamic alternating minimization algorithm for the NRDF, enabling near-optimal solutions with provable convergence.

Findings

01

The proposed algorithms effectively approximate the NRDF for Markov sources.

02

The methodology is validated through simulations on binary Markov processes.

03

The approach provides a near-optimal solution with convergence guarantees.

Abstract

This paper deals with the computation of a non-asymptotic lower bound by means of the nonanticipative rate-distortion function (NRDF) on the discrete-time zero-delay variable-rate lossy compression problem for discrete Markov sources with per-stage, single-letter distortion. First, we derive a new information structure of the NRDF for Markov sources and single-letter distortions. Second, we derive new convexity results on the NRDF, which facilitate the use of Lagrange duality theorem to cast the problem as an unconstrained partially observable finite-time horizon stochastic dynamic programming (DP) algorithm subject to a probabilistic state (belief state) that summarizes the past information about the reproduction symbols and takes values in a continuous state space. Instead of approximating the DP algorithm directly, we use Karush-Kuhn-Tucker (KKT) conditions to find an implicit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics