Quadratic-Time Hardness of LCS and other Sequence Similarity Measures
Amir Abboud, Arturs Backurs, Virginia Vassilevska Williams

TL;DR
This paper proves that significantly faster algorithms for computing the Longest Common Subsequence and Dynamic Time Warping distance would refute the Strong Exponential Time Hypothesis, establishing their quadratic-time hardness.
Contribution
It establishes conditional lower bounds for LCS and DTWD computations, showing no subquadratic algorithms exist under SETH, and extends these results to multiple sequences.
Findings
No $O(n^{2- ext{epsilon}})$ algorithms for LCS or DTWD unless SETH fails.
Computing LCS of $k$ strings over size $O(k)$ cannot be done in $O(n^{k- ext{epsilon}})$ time under SETH.
Approximating DTWD in truly subquadratic time is also hard under SETH.
Abstract
Two important similarity measures between sequences are the longest common subsequence (LCS) and the dynamic time warping distance (DTWD). The computations of these measures for two given sequences are central tasks in a variety of applications. Simple dynamic programming algorithms solve these tasks in time, and despite an extensive amount of research, no algorithms with significantly better worst case upper bounds are known. In this paper, we show that an time algorithm, for some , for computing the LCS or the DTWD of two sequences of length over a constant size alphabet, refutes the popular Strong Exponential Time Hypothesis (SETH). Moreover, we show that computing the LCS of strings over an alphabet of size cannot be done in time, for any , under SETH. Finally, we also address the time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Error Correcting Code Techniques · Cellular Automata and Applications
