A Proof of Entropy Minimization for Outputs in Deletion Channels via Hidden Word Statistics
Arash Atashpendar, David Mestel, A. W. Roscoe, Peter Y. A. Ryan

TL;DR
This paper proves that, in the limit of large input length, the output of a deletion channel minimizes the posterior entropy for certain constant and alternating strings, using hidden word statistics and combinatorial methods.
Contribution
It confirms the conjecture that specific string patterns minimize the posterior entropy in deletion channels, applying analytic combinatorics and hidden word statistics techniques.
Findings
Minimization of posterior entropy for constant and alternating strings in the asymptotic limit.
Application of hidden word statistics to analyze deletion channel outputs.
Establishment of entropy minimization via autocorrelation measures.
Abstract
From the output produced by a memoryless deletion channel from a uniformly random input of known length , one obtains a posterior distribution on the channel input. The difference between the Shannon entropy of this distribution and that of the uniform prior measures the amount of information about the channel input which is conveyed by the output of length , and it is natural to ask for which outputs this is extremized. This question was posed in a previous work, where it was conjectured on the basis of experimental data that the entropy of the posterior is minimized and maximized by the constant strings and and the alternating strings and respectively. In the present work we confirm the minimization conjecture in the asymptotic limit using results from hidden word statistics. We show how the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Biological Computing · Algorithms and Data Compression · Cellular Automata and Applications
