Scalable Algorithms for Approximate DNF Model Counting

Paul Burkhardt; David G. Harris; Kevin T Schmitt

arXiv:2601.10511·cs.DS·January 16, 2026

Scalable Algorithms for Approximate DNF Model Counting

Paul Burkhardt, David G. Harris, Kevin T Schmitt

PDF

Open Access

TL;DR

This paper introduces a new scalable Monte Carlo algorithm for approximate DNF model counting that outperforms previous methods in efficiency and scalability, with proven PAC bounds and practical success on large problems.

Contribution

It develops an adaptive Monte Carlo algorithm with stopping rules and short-circuit evaluation, providing theoretical guarantees and superior empirical performance.

Findings

01

Achieves PAC learning bounds

02

Outperforms prior algorithms by orders of magnitude

03

Scales to problems with millions of variables

Abstract

Model counting of Disjunctive Normal Form (DNF) formulas is a critical problem in applications such as probabilistic inference and network reliability. For example, it is often used for query evaluation in probabilistic databases. Due to the computational intractability of exact DNF counting, there has been a line of research into a variety of approximation algorithms. These include Monte Carlo approaches such as the classical algorithms of Karp, Luby, and Madras (1989), as well as methods based on hashing (Soos et al. 2023), and heuristic approximations based on Neural Nets (Abboud, Ceylan, and Lukasiewicz 2020). We develop a new Monte Carlo approach with an adaptive stopping rule and short-circuit formula evaluation. We prove it achieves Probably Approximately Correct (PAC) learning bounds and is asymptotically more efficient than the previous methods. We also show experimentally…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Data Quality and Management · Machine Learning and Algorithms