ContraFix: Agentic Vulnerability Repair via Differential Runtime Evidence and Skill Reuse
Simiao Liu, Fang Liu, Li Zhang, Yang Liu, Yinghao Zhu

TL;DR
ContraFix is a novel framework that improves vulnerability repair in large language model agents by combining differential runtime evidence with reusable repair skills, leading to higher success rates.
Contribution
It introduces a new agentic AVR framework that leverages differential runtime evidence and skill reuse, significantly enhancing repair accuracy and efficiency.
Findings
Achieves 84.0% repair success on SEC-Bench with GPT-5-mini.
Resolves 73.8% of tasks on PatchEval across multiple languages.
Outperforms state-of-the-art baselines while reducing computational costs.
Abstract
Large language model (LLM) agents are increasingly used for automated vulnerability repair (AVR), where repository-level reasoning enables them to inspect context and produce source-code patches. However, recent empirical results show that these agents still struggle with real-world vulnerabilities. Their main failure mode is semantic misunderstanding: choosing a repair direction that does not match the root cause. We identify two reasons for this gap. Existing agents usually reason from the failing execution alone. A crash report can pinpoint where the program failed, but it does not reveal which variable or state transition, among many candidates near the fault site, separates the crashing behavior from safe execution. As a result, agents often produce symptom-oriented patches instead of causal fixes. Moreover, evidence collected for one vulnerability is rarely retained, so similar…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
