UNSAT Solver Synthesis via Monte Carlo Forest Search

Chris Cameron; Jason Hartford; Taylor Lundy; Tuan Truong; Alan; Milligan; Rex Chen; Kevin Leyton-Brown

arXiv:2211.12581·cs.AI·July 16, 2024

UNSAT Solver Synthesis via Monte Carlo Forest Search

Chris Cameron, Jason Hartford, Taylor Lundy, Tuan Truong, Alan, Milligan, Rex Chen, Kevin Leyton-Brown

PDF

Open Access 1 Repo

TL;DR

This paper introduces Monte Carlo Forest Search (MCFS), a reinforcement learning method for efficiently solving complex tree-structured problems like SAT unsatisfiability, by learning policies that minimize search tree size.

Contribution

It presents the first RL approach to efficiently learn DPLL branching policies for SAT, avoiding exponential costs through sampling and early decision focus.

Findings

01

Matched or exceeded strong baseline performance

02

Handled problems two orders of magnitude more challenging

03

Introduced a novel RL algorithm for tree MDPs

Abstract

We introduce Monte Carlo Forest Search (MCFS), a class of reinforcement learning (RL) algorithms for learning policies in {tree MDPs}, for which policy execution involves traversing an exponential-sized tree. Examples of such problems include proving unsatisfiability of a SAT formula; counting the number of solutions of a satisfiable SAT formula; and finding the optimal solution to a mixed-integer program. MCFS algorithms can be seen as extensions of Monte Carlo Tree Search (MCTS) to cases where, rather than finding a good path (solution) within a tree, the problem is to find a small tree within a forest of candidate trees. We instantiate and evaluate our ideas in an algorithm that we dub Knuth Synthesis, an MCFS algorithm that learns DPLL branching policies for solving the Boolean satisfiability (SAT) problem, with the objective of achieving good average-case performance on a given…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chriscameron1/mcfs
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics