TerraFormer: Automated Infrastructure-as-Code with LLMs Fine-Tuned via Policy-Guided Verifier Feedback

Prithwish Jana; Sam Davidson; Bhavana Bhasker; Andrey Kan; Anoop Deoras; Laurent Callot

arXiv:2601.08734·cs.SE·January 14, 2026

TerraFormer: Automated Infrastructure-as-Code with LLMs Fine-Tuned via Policy-Guided Verifier Feedback

Prithwish Jana, Sam Davidson, Bhavana Bhasker, Andrey Kan, Anoop Deoras, Laurent Callot

PDF

Open Access

TL;DR

TerraFormer is a neuro-symbolic framework that enhances large language models for automated Infrastructure-as-Code generation by integrating formal verification feedback, resulting in significantly improved correctness and compliance.

Contribution

The paper introduces TerraFormer, a novel fine-tuning approach combining supervised learning with verifier-guided reinforcement learning for IaC generation.

Findings

01

Improves correctness of LLM-generated IaC by up to 19.60%.

02

Outperforms larger models on key benchmarks.

03

Achieves top rankings in best-practices and security compliance.

Abstract

Automating Infrastructure-as-Code (IaC) is challenging, and large language models (LLMs) often produce incorrect configurations from natural language (NL). We present TerraFormer, a neuro-symbolic framework for IaC generation and mutation that combines supervised fine-tuning with verifier-guided reinforcement learning, using formal verification tools to provide feedback on syntax, deployability, and policy compliance. We curate two large, high-quality NL-to-IaC datasets, TF-Gen (152k instances) and TF-Mutn (52k instances), via multi-stage verification and iterative LLM self-correction. Evaluations against 17 state-of-the-art LLMs, including ~50x larger models like Sonnet 3.7, DeepSeek-R1, and GPT-4.1, show that TerraFormer improves correctness over its base LLM by 15.94% on IaC-Eval, 11.65% on TF-Gen (Test), and 19.60% on TF-Mutn (Test). It outperforms larger models on both TF-Gen…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Advanced Malware Detection Techniques