Towards Autoformalization of LLM-generated Outputs for Requirement Verification

Mihir Gupte; Ramesh S

arXiv:2511.11829·cs.CL·November 18, 2025

Towards Autoformalization of LLM-generated Outputs for Requirement Verification

Mihir Gupte, Ramesh S

PDF

Open Access

TL;DR

This paper explores using a simple LLM-based autoformalizer to verify the logical correctness of LLM-generated outputs from natural language requirements, aiming to improve requirement verification processes.

Contribution

It introduces a preliminary approach for autoformalization-based verification of LLM outputs, demonstrating potential for consistency and correctness checks.

Findings

01

Autoformalizer identified logical equivalence between differently-worded requirements.

02

Autoformalizer detected logical inconsistency between requirement and LLM output.

03

Results suggest autoformalization can enhance fidelity of LLM-generated formal outputs.

Abstract

Autoformalization, the process of translating informal statements into formal logic, has gained renewed interest with the emergence of powerful Large Language Models (LLMs). While LLMs show promise in generating structured outputs from natural language (NL), such as Gherkin Scenarios from NL feature requirements, there's currently no formal method to verify if these outputs are accurate. This paper takes a preliminary step toward addressing this gap by exploring the use of a simple LLM-based autoformalizer to verify LLM-generated outputs against a small set of natural language requirements. We conducted two distinct experiments. In the first one, the autoformalizer successfully identified that two differently-worded NL requirements were logically equivalent, demonstrating the pipeline's potential for consistency checks. In the second, the autoformalizer was used to identify a logical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel-Driven Software Engineering Techniques · Natural Language Processing Techniques · Topic Modeling