Faster Algorithms for Longest Common Substring

Panagiotis Charalampopoulos; Tomasz Kociumaka; Jakub Radoszewski; Solon P. Pissis

arXiv:2105.03106·cs.DS·November 18, 2025

Faster Algorithms for Longest Common Substring

Panagiotis Charalampopoulos, Tomasz Kociumaka, Jakub Radoszewski, Solon P. Pissis

PDF

TL;DR

This paper introduces faster algorithms for the longest common substring problem and its k-mismatch variant, achieving sublinear time in certain models and breaking previous complexity barriers.

Contribution

It presents novel algorithms that improve the time complexity for LCS and k-mismatch LCS problems, surpassing prior known bounds and utilizing optimal space.

Findings

01

LCS computed in $O(n rac{ ext{log} \sigma}{ ext{sqrt}( ext{log} n)})$ time for small alphabets.

02

k-mismatch LCS computed in $O(n ext{log}^{k-1/2} n)$ time, breaking the $n ext{log}^k n$ barrier.

03

Algorithms use optimal $O(n rac{ ext{log} \sigma}{ ext{log} n})$ space.

Abstract

In the classic longest common substring (LCS) problem, we are given two strings $S$ and $T$ , each of length at most $n$ , over an alphabet of size $σ$ , and we are asked to find a longest string occurring as a fragment of both $S$ and $T$ . Weiner, in his seminal paper that introduced the suffix tree, presented an $O (n lo g σ)$ -time algorithm for this problem [SWAT 1973]. For polynomially-bounded integer alphabets, the linear-time construction of suffix trees by Farach yielded an $O (n)$ -time algorithm for the LCS problem [FOCS 1997]. However, for small alphabets, this is not necessarily optimal for the LCS problem in the word RAM model of computation, in which the strings can be stored in $O (n lo g σ / lo g n)$ space and read in $O (n lo g σ / lo g n)$ time. We show that, in this model, we can compute an LCS in time $O (n lo g σ / lo g n)$ , which is sublinear in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.