Edit distance in substitution systems
Andrew Best, Yuval Peres

TL;DR
This paper investigates the edit distance among words generated by primitive substitutions, improving bounds on their diameter and analyzing specific cases like the Thue–Morse substitution.
Contribution
It provides an improved upper bound of the edit distance diameter for substitution words and analyzes the specific case of the Thue–Morse substitution.
Findings
Upper bound on edit distance diameter improved to O(n/√log n)
Diameter for Thue–Morse words is at least √(n/6) - 1
Handling non-uniform substitutions presents main challenge
Abstract
Let be a primitive substitution on an alphabet , and let be the set of words of length determined by (i.e., if is a subword of for some and ). It is known that the corresponding substitution dynamical system is loosely Kronecker (also known as zero-entropy loosely Bernoulli), so the diameter of in the edit distance is . We improve this upper bound to . The main challenge is handling the case where is non-uniform; a better bound is available for the uniform case. Finally, we show that for the Thue--Morse substitution, the diameter of is at least .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Biological Computing · semigroups and automata theory · Algorithms and Data Compression
