UNSAT Solver Synthesis via Monte Carlo Forest Search
Chris Cameron, Jason Hartford, Taylor Lundy, Tuan Truong, Alan, Milligan, Rex Chen, Kevin Leyton-Brown

TL;DR
This paper introduces Monte Carlo Forest Search (MCFS), a reinforcement learning method for efficiently solving complex tree-structured problems like SAT unsatisfiability, by learning policies that minimize search tree size.
Contribution
It presents the first RL approach to efficiently learn DPLL branching policies for SAT, avoiding exponential costs through sampling and early decision focus.
Findings
Matched or exceeded strong baseline performance
Handled problems two orders of magnitude more challenging
Introduced a novel RL algorithm for tree MDPs
Abstract
We introduce Monte Carlo Forest Search (MCFS), a class of reinforcement learning (RL) algorithms for learning policies in {tree MDPs}, for which policy execution involves traversing an exponential-sized tree. Examples of such problems include proving unsatisfiability of a SAT formula; counting the number of solutions of a satisfiable SAT formula; and finding the optimal solution to a mixed-integer program. MCFS algorithms can be seen as extensions of Monte Carlo Tree Search (MCTS) to cases where, rather than finding a good path (solution) within a tree, the problem is to find a small tree within a forest of candidate trees. We instantiate and evaluate our ideas in an algorithm that we dub Knuth Synthesis, an MCFS algorithm that learns DPLL branching policies for solving the Boolean satisfiability (SAT) problem, with the objective of achieving good average-case performance on a given…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
