Micro-Diffusion Compression - Binary Tree Tweedie Denoising for Online Probability Estimation
Roberto Tacconelli

TL;DR
Midicoth introduces a novel micro-diffusion denoising layer that enhances probability estimates in lossless compression by decomposing predictions into binary decisions, enabling efficient calibration and improved compression performance.
Contribution
The paper proposes a new binary tree-based denoising method for online probability calibration, improving compression efficiency in adaptive models like PPM.
Findings
Improved probability calibration leads to better compression ratios.
Binary decomposition enables reliable correction with limited data.
The method is fully online and integrates with existing models.
Abstract
We present Midicoth, a lossless compression system that introduces a micro-diffusion denoising layer for improving probability estimates produced by adaptive statistical models. In compressors such as Prediction by Partial Matching (PPM), probability estimates are smoothed by a prior to handle sparse observations. When contexts have been seen only a few times, this prior dominates the prediction and produces distributions that are significantly flatter than the true source distribution, leading to compression inefficiency. Midicoth addresses this limitation by treating prior smoothing as a shrinkage process and applying a reverse denoising step that corrects predicted probabilities using empirical calibration statistics. To make this correction data-efficient, the method decomposes each byte prediction into a hierarchy of binary decisions along a bitwise tree. This converts a single…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Algorithms and Data Compression · Speech Recognition and Synthesis
