Computing Runs on a General Alphabet
Dmitry Kosolobov

TL;DR
This paper presents a RAM algorithm for finding all maximal repetitions in a string over a general alphabet with sub-quadratic time complexity, outperforming previous solutions for large alphabets.
Contribution
It introduces a new algorithm that computes all runs in $O(n ext{log}^{2/3} n)$ time, improving over existing methods for large alphabet sizes.
Findings
Algorithm runs in $O(n ext{log}^{2/3} n)$ time
Outperforms previous solutions for $\sigma = n^{ ext{Omega}(1)}$
Conjecture of a possible linear time algorithm for all runs
Abstract
We describe a RAM algorithm computing all runs (maximal repetitions) of a given string of length over a general ordered alphabet in time and linear space. Our algorithm outperforms all known solutions working in time provided , where is the alphabet size. We conjecture that there exists a linear time RAM algorithm finding all runs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · semigroups and automata theory · DNA and Biological Computing
