String Attractors: Verification and Optimization
Dominik Kempa, Alberto Policriti, Nicola Prezza, Eva Rotenberg

TL;DR
This paper advances the theory of string attractors by providing efficient algorithms for verification, minimality, and approximation, leveraging new reductions based on the truncated suffix tree, and explores the complexity of related problems.
Contribution
It introduces a novel reduction using truncated suffix trees, enabling faster algorithms for verifying and approximating string attractors, and analyzes the complexity of the sharp-$k$-attractor problem.
Findings
Minimum 3-attractor can be found in O(n) time under certain conditions.
A 2.45-approximation for 3-attractor is achievable in O(n) time.
Sharp-$k$-attractor problem is NP-complete for k ≥ 3.
Abstract
String attractors [STOC 2018] are combinatorial objects recently introduced to unify all known dictionary compression techniques in a single theory. A set is a -attractor for a string if and only if every distinct substring of of length at most has an occurrence straddling at least one of the positions in . Finding the smallest -attractor is NP-hard for , but polylogarithmic approximations can be found using reductions from dictionary compressors. It is easy to reduce the -attractor problem to a set-cover instance where string's positions are interpreted as sets of substrings. The main result of this paper is a much more powerful reduction based on the truncated suffix tree. Our new characterization of the problem leads to more efficient algorithms for string attractors: we show how to check the validity and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
