Evaluating Pre-Trained Models for Multi-Language Vulnerability Patching

Zanis Ali Khan; Aayush Garg; Yuejun Guo; Qiang Tang

arXiv:2501.07339·cs.SE·January 14, 2025

Evaluating Pre-Trained Models for Multi-Language Vulnerability Patching

Zanis Ali Khan, Aayush Garg, Yuejun Guo, Qiang Tang

PDF

TL;DR

This paper evaluates pre-trained models CodeBERT and CodeT5 for automated vulnerability patching across multiple languages, highlighting their strengths, limitations, and potential for security applications.

Contribution

It provides a comprehensive benchmark of these models' performance on vulnerability patching, revealing insights into their accuracy, efficiency, and challenges with patch length.

Findings

01

CodeT5 achieves higher accuracy on complex vulnerability datasets.

02

CodeBERT performs well with fragmented or limited context data.

03

Both models struggle as patch length increases.

Abstract

Software vulnerabilities pose critical security risks, demanding prompt and effective mitigation strategies. While advancements in Automated Program Repair (APR) have primarily targeted general software bugs, the domain of vulnerability patching, which is a security-critical subset of APR, remains underexplored. This paper investigates the potential of pre-trained language models, CodeBERT and CodeT5, for automated vulnerability patching across diverse datasets and five programming languages. We evaluate these models on their accuracy, computational efficiency, and how the length of vulnerable code patches impacts performance. Our findings reveal promising accuracy levels, particularly for CodeT5 on datasets with complex vulnerability patterns, while CodeBERT demonstrates strengths in handling fragmented or context-limited datasets. CodeT5 further showcases superior efficiency, making…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.