Faster Algorithms for Longest Common Substring
Panagiotis Charalampopoulos, Tomasz Kociumaka, Jakub Radoszewski, Solon P. Pissis

TL;DR
This paper introduces faster algorithms for the longest common substring problem and its k-mismatch variant, achieving sublinear time in certain models and breaking previous complexity barriers.
Contribution
It presents novel algorithms that improve the time complexity for LCS and k-mismatch LCS problems, surpassing prior known bounds and utilizing optimal space.
Findings
LCS computed in $O(n rac{ ext{log} \sigma}{ ext{sqrt}( ext{log} n)})$ time for small alphabets.
k-mismatch LCS computed in $O(n ext{log}^{k-1/2} n)$ time, breaking the $n ext{log}^k n$ barrier.
Algorithms use optimal $O(n rac{ ext{log} \sigma}{ ext{log} n})$ space.
Abstract
In the classic longest common substring (LCS) problem, we are given two strings and , each of length at most , over an alphabet of size , and we are asked to find a longest string occurring as a fragment of both and . Weiner, in his seminal paper that introduced the suffix tree, presented an -time algorithm for this problem [SWAT 1973]. For polynomially-bounded integer alphabets, the linear-time construction of suffix trees by Farach yielded an -time algorithm for the LCS problem [FOCS 1997]. However, for small alphabets, this is not necessarily optimal for the LCS problem in the word RAM model of computation, in which the strings can be stored in space and read in time. We show that, in this model, we can compute an LCS in time , which is sublinear in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
