On Maximal Unbordered Factors
Gregory Kucherov, Alexander Loptev, Tatiana Starikovskaya

TL;DR
This paper studies the properties of the longest unbordered substring in a string, proving it is nearly as long as the string itself for large strings over alphabets of size five or more, and introduces a new algorithm for finding it.
Contribution
It establishes a high expected length of the maximal unbordered factor for large strings over certain alphabets and proposes a new efficient algorithm for its computation.
Findings
Expected maximal unbordered factor length is at least 0.99 n for large n and alphabet size ≥ 5.
The result applies to strings over alphabets of size five or more.
A new algorithm for computing the maximal unbordered factor is proposed.
Abstract
Given a string of length , its maximal unbordered factor is the longest factor which does not have a border. In this work we investigate the relationship between and the length of the maximal unbordered factor of . We prove that for the alphabet of size the expected length of the maximal unbordered factor of a string of length~ is at least (for sufficiently large values of ). As an application of this result, we propose a new algorithm for computing the maximal unbordered factor of a string.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · semigroups and automata theory · DNA and Biological Computing
