Rooting Out Entropy: Optimal Tree Extraction for Ultra-Succinct Graphs
Ziad Ismaili Alaoui, Tamio-Vesa Nakajima, Namrata, Sebastian Wild

TL;DR
This paper introduces a novel optimization problem for graph compression called MINETREX, analyzes its computational hardness, and proposes a greedy algorithm that enables ultra-succinct graph representations with practical query support.
Contribution
It formulates the MINETREX problem for entropy-based graph compression, proves its NP-hardness, and provides a simple greedy approximation algorithm with theoretical guarantees.
Findings
MINETREX is NP-hard to approximate within a certain additive error.
The greedy algorithm achieves an additive error of at most n / ln 2.
The resulting data structure enables space-efficient graph storage with logarithmic query time.
Abstract
We combine two methods for the lossless compression of unlabeled graphs - entropy compressing adjacency lists and computing canonical names for vertices - and solve an ensuing novel optimisation problem: Minimum-Entropy Tree-Extraction (MINETREX). MINETREX asks to determine a spanning forest to remove from a graph so that the remaining graph has minimal indegree entropy among all choices for . (Here is the indegree of vertex in ; is the number of edges.) We show that MINETREX is NP-hard to approximate with additive error better than (for some constant ), and provide a simple greedy algorithm that achieves additive error at most . By storing the extracted spanning forest and the remaining edges separately, we obtain a degree-entropy compressed ("ultrasuccinct") data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Algorithms and Data Compression · Complexity and Algorithms in Graphs
