Efficiently Enumerating Hitting Sets of Hypergraphs Arising in Data Profiling
Thomas Bl\"asius, Tobias Friedrich, Julius Lischeid, Kitty Meeks, and, Martin Schirneck

TL;DR
This paper presents a new polynomial-delay and polynomial-space algorithm for enumerating minimal hitting sets in hypergraphs, with applications to data profiling, improving upon previous methods and providing theoretical and empirical insights.
Contribution
It introduces a hypergraph enumeration algorithm with delay and space bounds independent of the solution size, and analyzes the complexity of the extension problem for minimal hitting sets.
Findings
Algorithm achieves polynomial delay and space for hypergraph enumeration.
Extension problem for minimal hitting sets is W[3]-complete and nearly optimally solvable.
Empirical results show improved performance on real-world data profiling tasks.
Abstract
The transversal hypergraph problem is the task of enumerating the minimal hitting sets of a hypergraph. It is a long-standing open question whether this can be done in output-polynomial time. For hypergraphs whose solutions have bounded size, Eiter and Gottlob [SICOMP 1995] gave an algorithm that runs in output-polynomial time, but whose space requirement also scales with the output size. We improve this to polynomial delay and polynomial space. More generally, we present an algorithm that on -vertex, -edge hypergraphs has delay and uses space, where is the maximum size of any minimal hitting set. Our algorithm is oblivious to , a quantity that is hard to compute or even approximate. Central to our approach is the extension problem for minimal hitting sets, deciding for a set of vertices whether it is contained in any solution. With…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
