Synchronizing Probabilities in Model-Driven Lossless Compression

Aviv Adler; Jennifer Tang

arXiv:2601.10678·cs.IT·March 10, 2026

Synchronizing Probabilities in Model-Driven Lossless Compression

Aviv Adler, Jennifer Tang

PDF

Open Access 3 Reviews

TL;DR

This paper introduces PMATIC, a novel probabilistic coding method that tolerates prediction mismatches in model-driven lossless compression, ensuring robustness and improved compression rates with theoretical guarantees and empirical validation.

Contribution

The paper formalizes prediction mismatch issues in model-driven compression and proposes PMATIC, a model-agnostic algorithm that maintains correctness and efficiency despite bounded prediction errors.

Findings

01

PMATIC is robust to prediction mismatch in compression.

02

PMATIC outperforms standard compression tools on text data.

03

Theoretical bounds confirm PMATIC's effectiveness.

Abstract

It is well-known in the field of lossless data compression that probabilistic next-symbol prediction can be used to compress sequences of symbols. Deep neural networks are able to capture rich dependencies in data, offering a powerful means of estimating these probabilities and hence an avenue towards more effective compression algorithms. However, both compressor and decompressor must have exactly matching predictions; even small differences from non-determinism (which often happen with learned models due to hardware, software, or computation order) can lead to cascading decoding failures. In this paper, we formalize the problem of prediction mismatch in model-driven compression, and introduce Probability Matching Interval Coding (PMATIC), a model-agnostic algorithm that tolerates bounded prediction mismatch with low overhead. PMATIC works with the predicted probabilities, making it…

Peer Reviews

Decision·ICLR 2026 Poster

Reviewer 01Rating 8Confidence 4

Strengths

1. This is a very relevant problem, I believe this would allow the community to actually make practical compressors with the proposed algorithm. 2. Article is original and a novel algorithm is proposed. Paper is quite easy to read.

Weaknesses

1. The only weakness I would say is some analysis on what delta's would we expect by changing hardware or going from GPUs to CPUs. Also there is relevant research addressing uncertainty in LLM prediction which should be added to related work (https://thinkingmachines.ai/blog/defeating-nondeterminism-in-llm-inference/). 2. Please add comparison with some more compressors, especially CMIX. Also include latency because that is where traditional compressors win by a huge margin.

Reviewer 02Rating 6Confidence 4

Strengths

- The method is reasonably simple to apply to existing LLMs, without needing to retrain. - The authors provide some guarantees on the added bit length due to binning. - This is a very important, and realistic problem, that needs to be solved to have next-gen AI codecs, and I appreciate the authors pushing on real world problems.

Weaknesses

- The experiments are significantly lacking in breadth. If the method is general, the authors could provide further experiments with different data modalities. - The following is hard, but would significantly improve the paper: can the authors estimate what are typical deviations present in relevant scenarios where AI codecs could be applied? For example, take any open source model, and apply the encoder and decoder using 1) a different version of CUDA, 2) different models, and other variables

Reviewer 03Rating 8Confidence 3

Strengths

Originality: This paper is the first to mathematically formalize the problem of "probabilistic model-driven compression" $\times$ "LLM nondeterminism". Quality: Not only is there a mathematical discussion of performance analysis, but the usefulness of the proposed method is also verified through numerical experiments, making the paper of high quality. Clarity: The problem to be addressed is clearly stated, the algorithm is given in detail, and the argument is clear enough. Significance: This pa

Weaknesses

As mentioned in 6. Future work, if LLM is actually used for lossless compression, the size of the models required to implement the compressor and decoder will be a major problem. This issue has not been discussed in this paper.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgorithms and Data Compression · Advanced Data Compression Techniques · Speech Recognition and Synthesis