FormalSpecCpp: A Dataset of C++ Formal Specifications created using LLMs

Madhurima Chakraborty; Peter Pirkelbauer; Qing Yi

arXiv:2502.15217·cs.SE·February 24, 2025

FormalSpecCpp: A Dataset of C++ Formal Specifications created using LLMs

Madhurima Chakraborty, Peter Pirkelbauer, Qing Yi

PDF

1 Repo

TL;DR

FormalSpecCpp is a pioneering dataset of C++ programs with formal specifications, created using LLMs, to facilitate benchmarking and advancement in program verification and specification inference.

Contribution

It introduces the first comprehensive C++ formal specifications dataset, enabling evaluation of inference tools and LLM fine-tuning for automated specification generation.

Findings

01

Provides a structured benchmark for specification inference tools

02

Enables fine-tuning of LLMs for automated specification generation

03

Supports research in program verification and AI-assisted development

Abstract

FormalSpecCpp is a dataset designed to fill the gap in standardized benchmarks for verifying formal specifications in C++ programs. To the best of our knowledge, this is the first comprehensive collection of C++ programs with well-defined preconditions and postconditions. It provides a structured benchmark for evaluating specification inference tools and testing theaccuracy of generated specifications. Researchers and developers can use this dataset to benchmark specification inference tools,fine-tune Large Language Models (LLMs) for automated specification generation, and analyze the role of formal specifications in improving program verification and automated testing. By making this dataset publicly available, we aim to advance research in program verification, specification inference, and AI-assisted software development. The dataset and the code are available at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MadhuNimmo/FormalSpecCpp
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.