On Maximal Unbordered Factors

Gregory Kucherov; Alexander Loptev; Tatiana Starikovskaya

arXiv:1504.07406·cs.DS·April 29, 2015

On Maximal Unbordered Factors

Gregory Kucherov, Alexander Loptev, Tatiana Starikovskaya

PDF

Open Access

TL;DR

This paper studies the properties of the longest unbordered substring in a string, proving it is nearly as long as the string itself for large strings over alphabets of size five or more, and introduces a new algorithm for finding it.

Contribution

It establishes a high expected length of the maximal unbordered factor for large strings over certain alphabets and proposes a new efficient algorithm for its computation.

Findings

01

Expected maximal unbordered factor length is at least 0.99 n for large n and alphabet size ≥ 5.

02

The result applies to strings over alphabets of size five or more.

03

A new algorithm for computing the maximal unbordered factor is proposed.

Abstract

Given a string $S$ of length $n$ , its maximal unbordered factor is the longest factor which does not have a border. In this work we investigate the relationship between $n$ and the length of the maximal unbordered factor of $S$ . We prove that for the alphabet of size $σ \geq 5$ the expected length of the maximal unbordered factor of a string of length~ $n$ is at least $0.99 n$ (for sufficiently large values of $n$ ). As an application of this result, we propose a new algorithm for computing the maximal unbordered factor of a string.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgorithms and Data Compression · semigroups and automata theory · DNA and Biological Computing