RePair Grammars are the Smallest Grammars for Fibonacci Words
Takuya Mieno, Shunsuke Inenaga, Takashi Horiyama

TL;DR
This paper proves that RePair grammars are the smallest possible for Fibonacci words, establishing RePair's optimality for this infinite family of strings, which was previously unknown.
Contribution
It characterizes RePair as producing the smallest grammars for Fibonacci words, demonstrating its optimality for this non-trivial infinite family.
Findings
RePair generates smallest grammars for Fibonacci words
RePair's optimality extends to an infinite family of strings
No other grammars are smaller than RePair for Fibonacci words
Abstract
Grammar-based compression is a loss-less data compression scheme that represents a given string by a context-free grammar that generates only . While computing the smallest grammar which generates a given string is NP-hard in general, a number of polynomial-time grammar-based compressors which work well in practice have been proposed. RePair, proposed by Larsson and Moffat in 1999, is a grammar-based compressor which recursively replaces all possible occurrences of a most frequently occurring bigrams in the string. Since there can be multiple choices of the most frequent bigrams to replace, different implementations of RePair can result in different grammars. In this paper, we show that the smallest grammars generating the Fibonacci words can be completely characterized by RePair, where denotes the -th Fibonacci word. Namely, all grammars for generated by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
