Balancing Straight-Line Programs
Moses Ganardi, Artur Je\.z, Markus Lohrey

TL;DR
This paper demonstrates that context-free grammars and other grammar-based compression formalisms can be efficiently balanced to have logarithmic depth while maintaining linear size, solving a key open problem in compression theory.
Contribution
It introduces a universal method to balance grammar-based representations, reducing their depth to logarithmic scale with minimal size increase, applicable to multiple formalism types.
Findings
Balanced grammars have depth O(log |w|)
Linear time transformation to balanced grammars
Applicable to string and tree compression formalisms
Abstract
It is shown that a context-free grammar of size that produces a single string (such a grammar is also called a string straight-line program) can be transformed in linear time into a context-free grammar for of size , whose unique derivation tree has depth . This solves an open problem in the area of grammar-based compression. Similar results are shown for two formalism for grammar-based tree compression: top dags and forest straight-line programs. These balancing results are all deduced from a single meta theorem stating that the depth of an algebraic circuit over an algebra with a certain finite base property can be reduced to with the cost of a constant multiplicative size increase. Here, refers to the size of the unfolding (or unravelling) of the circuit.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
