HardSATGEN: Understanding the Difficulty of Hard SAT Formula Generation and A Strong Structure-Hardness-Aware Baseline
Yang Li, Xinyan Chen, Wenxuan Guo, Xijun Li, Wanqian Luo, Junhua, Huang, Hui-Ling Zhen, Mingxuan Yuan, Junchi Yan

TL;DR
HardSATGEN is a novel SAT formula generation method that better captures industrial formula structures and hardness, significantly improving the quality of generated instances for solver tuning.
Contribution
We introduce HardSATGEN, a fine-grained control mechanism for neural split-merge in SAT generation, addressing previous limitations in reproducing hardness and structural properties.
Findings
HardSATGEN outperforms previous methods in structural similarity and hardness preservation.
Generated instances significantly improve solver tuning effectiveness.
Experiments on private and corporate benchmarks validate the approach.
Abstract
Industrial SAT formula generation is a critical yet challenging task. Existing SAT generation approaches can hardly simultaneously capture the global structural properties and maintain plausible computational hardness. We first present an in-depth analysis for the limitation of previous learning methods in reproducing the computational hardness of original instances, which may stem from the inherent homogeneity in their adopted split-merge procedure. On top of the observations that industrial formulae exhibit clear community structure and oversplit substructures lead to the difficulty in semantic formation of logical structures, we propose HardSATGEN, which introduces a fine-grained control mechanism to the neural split-merge paradigm for SAT formula generation to better recover the structural and computational properties of the industrial benchmarks. Experiments including evaluations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Testing and Debugging Techniques · Natural Language Processing Techniques
