Elastic-Degenerate String Matching with 1 Error
Giulia Bernardini, Est\'eban Gabory, Solon P. Pissis, Leen Stougie,, Michelle Sweering, Wiktor Zuba

TL;DR
This paper presents improved algorithms for elastic-degenerate string matching with one error, achieving faster runtimes by reducing the problem to computational geometry and indexing techniques, especially for DNA sequence analysis.
Contribution
The paper introduces new algorithms for 1-error elastic-degenerate string matching with significantly improved time complexities, utilizing reductions to geometry problems and advanced indexing methods.
Findings
Achieves $ ilde{ ext{O}}(nm^2 + N)$ time for 1-error matching under edit distance.
Provides a faster decision algorithm with $ ext{O}(nm^2\sqrt{ ext{log} ext{m}} + N ext{log} ext{log} ext{m})$ complexity.
Extends techniques to Hamming distance with $ ext{O}(nm^2 + N ext{log} ext{m})$ time.
Abstract
An elastic-degenerate string is a sequence of finite sets of strings of total length , introduced to represent a set of related DNA sequences, also known as a pangenome. The ED string matching (EDSM) problem consists in reporting all occurrences of a pattern of length in an ED text. This problem has recently received some attention by the combinatorial pattern matching community, culminating in an -time algorithm [Bernardini et al., SIAM J. Comput. 2022], where denotes the matrix multiplication exponent and the notation suppresses polylog factors. In the -EDSM problem, the approximate version of EDSM, we are asked to report all pattern occurrences with at most errors. -EDSM can be solved in time, under edit distance, or time, under…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Network Packet Processing and Optimization · Natural Language Processing Techniques
