Towards General Loop Invariant Generation: A Benchmark of Programs with   Memory Manipulation

Chang Liu; Xiwei Wu; Yuan Feng; Qinxiang Cao; Junchi Yan

arXiv:2311.10483·cs.PL·June 10, 2024·1 cites

Towards General Loop Invariant Generation: A Benchmark of Programs with Memory Manipulation

Chang Liu, Xiwei Wu, Yuan Feng, Qinxiang Cao, Junchi Yan

PDF

Open Access

TL;DR

This paper introduces a new benchmark for programs with complex data structures and memory manipulations, revealing limitations of existing methods and proposing a novel LLM-SE framework that outperforms current approaches in automated loop invariant generation.

Contribution

The paper presents LIG-MM, a comprehensive benchmark for complex programs, and introduces LLM-SE, a novel framework combining large language models with symbolic execution for improved invariant generation.

Findings

01

Previous methods fail to automate verification on LIG-MM programs.

02

LLM-SE outperforms state-of-the-art methods on the benchmark.

03

The benchmark reveals gaps in current invariant generation techniques.

Abstract

Program verification is vital for ensuring software reliability, especially in the context of increasingly complex systems. Loop invariants, remaining true before and after each iteration of loops, are crucial for this verification process. Traditional provers and machine learning based methods for generating loop invariants often require expert intervention or extensive labeled data, and typically only handle numerical property verification. These methods struggle with programs involving complex data structures and memory manipulations, limiting their applicability and automation capabilities. In this paper, we introduce a new benchmark named LIG-MM, specifically for programs with complex data structures and memory manipulations. We collect 312 programs from various sources, including daily programs from college homework, the international competition (SV-COMP), benchmarks from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Testing and Debugging Techniques · Parallel Computing and Optimization Techniques · Security and Verification in Computing