AutoICE: Automatically Synthesizing Verifiable C Code via LLM-driven Evolution
Weilin Luo, Xueyi Liang, Haotian Deng, Yanan Liu, Hai Wan

TL;DR
AutoICE leverages large language models and evolutionary search techniques to automatically synthesize verifiable C code from natural language requirements, improving correctness and reliability.
Contribution
It introduces diverse initialization, collaborative crossover, and self-reflective mutation to enhance verifiable code synthesis, addressing errors and implicit knowledge formalization.
Findings
Achieves 90.36% verification success rate on standard datasets.
Outperforms state-of-the-art methods in code verification accuracy.
Attains 88.33% success rate on developer-friendly dataset.
Abstract
Automatically synthesizing verifiable code from natural language requirements ensures software correctness and reliability while significantly lowering the barrier to adopting the techniques of formal methods. With the rise of large language models (LLMs), long-standing efforts at autoformalization have gained new momentum. However, existing approaches suffer from severe syntactic and semantic errors due to the scarcity of domain-specific pre-training corpora and often fail to formalize implicit knowledge effectively. In this paper, we propose AutoICE, an LLM-driven evolutionary search for synthesizing verifiable C code. It introduces the diverse individual initialization and the collaborative crossover to enable diverse iterative updates, thereby mitigating error propagation inherent in single-agent iterations. Besides, it employs the self-reflective mutation to facilitate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Machine Learning and Data Classification · Topic Modeling
