A regularity lemma and twins in words
Maria Axenovich, Yury Person, Svetlana Puzynina

TL;DR
This paper proves that any binary word of length n can be nearly evenly split into two disjoint identical subwords, using a regularity lemma for words, and extends the result to larger alphabets and multiple subwords.
Contribution
It introduces a regularity lemma for words and demonstrates that binary words can be almost perfectly partitioned into twins, advancing understanding of word structure and subword repetitions.
Findings
Binary words of length n contain nearly n/2 disjoint identical subwords.
The result extends to k identical subwords over alphabets with at most k letters.
The regularity lemma for words is a key tool in these proofs.
Abstract
For a word , let be the largest integer such that there are two disjoints identical (scattered) subwords of length . Let . Here, it is shown that \[2f(n, \{0,1\}) = n-o(n)\] using the regularity lemma for words. I.e., any binary word of length can be split into two identical subwords (referred to as twins) and, perhaps, a remaining subword of length . A similar result is proven for identical subwords of a word over an alphabet with at most letters.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicssemigroups and automata theory · Coding theory and cryptography · Finite Group Theory Research
