Small space and streaming pattern matching with k edits
Tomasz Kociumaka, Ely Porat, Tatiana Starikovskaya

TL;DR
This paper introduces a streaming algorithm for approximate pattern matching under edit distance with space and time complexities polynomial in k, solving a longstanding open problem and providing new sketching techniques.
Contribution
It presents the first poly(k log n)-space streaming algorithm for edit distance pattern matching, along with a novel edit distance sketch with improved size and computation.
Findings
Achieves $ ilde{O}(k^5)$ space and $ ilde{O}(k^8)$ time per character in streaming model.
Introduces a new edit distance sketch of size $ ilde{O}(k^2)$ with efficient computation.
Provides deterministic semi-streaming algorithm with better complexity.
Abstract
In this work, we revisit the fundamental and well-studied problem of approximate pattern matching under edit distance. Given an integer , a pattern of length , and a text of length , the task is to find substrings of that are within edit distance from . Our main result is a streaming algorithm that solves the problem in space and amortised time per character of the text, providing answers correct with high probability. (Hereafter, hides a factor.) This answers a decade-old question: since the discovery of a -space streaming algorithm for pattern matching under Hamming distance by Porat and Porat [FOCS 2009], the existence of an analogous result for edit distance remained open. Up to this work, no -space algorithm was known even in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
