FormalEvolve: Neuro-Symbolic Evolutionary Search for Diverse and Prover-Effective Autoformalization
Haijian Lu, Wei Wang, Jing Liu

TL;DR
FormalEvolve introduces a neuro-symbolic evolutionary approach to autoformalization, enhancing diversity and prover effectiveness by generating multiple formalizations within a limited computational budget, leading to higher semantic hit rates and improved proof success.
Contribution
It formulates autoformalization as a budgeted search problem and proposes a novel neuro-symbolic evolutionary framework combining LLM-driven mutations with symbolic AST rewrites.
Findings
Achieves 58.0% and 84.9% semantic hit rates on benchmarks.
Reduces concentration of successful formalizations across problems.
Improves downstream proof success within fixed prover budgets.
Abstract
Autoformalization aims to translate natural-language mathematics into compilable, machine-checkable statements. However, semantic consistency does not imply prover effectiveness: even semantically consistent formalizations can differ substantially in proof-search cost and success rate. In this work, we formulate autoformalization as a budgeted, test-time search for semantically consistent repertoires, and propose FormalEvolve, a compilation-gated neuro-symbolic evolutionary framework. FormalEvolve generates diverse candidates via LLM-driven mutation and crossover with bounded patch repair, while symbolic Abstract Syntax Tree (AST) rewrite operations further inject structural diversity. On CombiBench and ProofNet, under a strict generator-call budget of T = 100, FormalEvolve reaches semantic hit rates (SH@100) of 58.0% and 84.9%, and reduces cross-problem concentration of semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Mathematics, Computing, and Information Processing · Evolutionary Algorithms and Applications
