Smallest Suffixient Sets: Effectiveness, Resilience, and Calculation
Hiroto Fujimaru, Gonzalo Navarro, Giuseppe Romana, Cristian Urbina

TL;DR
This paper introduces the suffixient set as a new measure of string repetitiveness, analyzes its properties under various string operations, and compares it with existing measures, providing algorithms and bounds.
Contribution
It defines the smallest suffixient set, studies its sensitivity to string modifications, and relates it to known measures like runs and lexicographic parsing.
Findings
the size of the smallest suffixient set cannot increase by more than 2 after appending or prepending a character.
the ratio of suffixient sets of a string and its reverse is at most 2.
bounds for the increase of suffixient set size after edit operations and rotations.
Abstract
A suffixient set is a novel combinatorial object that captures the essential information of repetitive strings in a way that, provided with a random access mechanism, supports various forms of pattern matching. In this paper, we study the size of the smallest suffixient set as a repetitiveness measure. First, we study its sensitivity to various string operations. We show that cannot increase by more than 2 after appending or prepending a character to the string. As a consequence, we are able to give simple linear-time online algorithms to compute smallest suffixient sets. We also show that, although reversing the string can increase by an arbitrary value, it always holds . We also prove lower and upper bounds for the additive or multiplicative increase of after applying arbitrary edit operations, or rotating the text. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
