Logarithmically larger deletion codes of all distances
Noga Alon, Gabriela Bourla, Ben Graham, Xiaoyu He, Noah Kravitz

TL;DR
This paper improves the asymptotic bounds on the size of deletion codes, showing they can be exponentially larger than previously known, with sizes scaling as at least _k(2^n \, log n / n^{2k}).
Contribution
The paper provides the first asymptotic improvement to classical bounds on deletion code sizes, demonstrating larger codes exist for all distances.
Findings
Existence of larger deletion codes with size _k(2^n \, log n / n^{2k})
Improved bounds inspired by Jiang and Vardy's work
Results on longest common subsequences and shortest common supersequences
Abstract
The deletion distance between two binary words is the smallest such that and share a common subsequence of length . A set of binary words of length is called a -deletion code if every pair of distinct words in has deletion distance greater than . In 1965, Levenshtein initiated the study of deletion codes by showing that, for fixed and going to infinity, a -deletion code of maximum size satisfies . We make the first asymptotic improvement to these bounds by showing that there exist -deletion codes with size at least . Our proof is inspired by Jiang and Vardy's improvement to the classical Gilbert--Varshamov bounds. We also establish several related results on the number of longest common subsequences and shortest…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Biological Computing · Advanced biosensing and bioanalysis techniques · Coding theory and cryptography
