Pre-trained Model-based Actionable Warning Identification: A Feasibility   Study

Xiuting Ge; Chunrong Fang; Quanjun Zhang; Daoyuan Wu and; Bowen Yu; Qirui Zheng; An Guo; Shangwei Lin; Zhihong Zhao and; Yang Liu; Zhenyu Chen

arXiv:2403.02716·cs.SE·March 6, 2024·1 cites

Pre-trained Model-based Actionable Warning Identification: A Feasibility Study

Xiuting Ge, Chunrong Fang, Quanjun Zhang, Daoyuan Wu and, Bowen Yu, Qirui Zheng, An Guo, Shangwei Lin, Zhihong Zhao and, Yang Liu, Zhenyu Chen

PDF

Open Access

TL;DR

This study evaluates the feasibility of using pre-trained models for actionable warning identification in static code analysis, demonstrating significant performance improvements over traditional machine learning methods.

Contribution

First systematic investigation of PTMs for AWI, including extensive evaluation, analysis of workflow factors, and practical guidelines for future improvements.

Findings

01

PTMs outperform state-of-the-art ML-based approaches by 9.85% to 21.12%.

02

Impact of data preprocessing, training, and prediction on PTM performance analyzed.

03

Identified reasons for PTM underperformance and provided practical enhancement guidelines.

Abstract

Actionable Warning Identification (AWI) plays a pivotal role in improving the usability of static code analyzers. Currently, Machine Learning (ML)-based AWI approaches, which mainly learn an AWI classifier from labeled warnings, are notably common. However, these approaches still face the problem of restricted performance due to the direct reliance on a limited number of labeled warnings to develop a classifier. Very recently, Pre-Trained Models (PTMs), which have been trained through billions of text/code tokens and demonstrated substantial success applications on various code-related tasks, could potentially circumvent the above problem. Nevertheless, the performance of PTMs on AWI has not been systematically investigated, leaving a gap in understanding their pros and cons. In this paper, we are the first to explore the feasibility of applying various PTMs for AWI. By conducting the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Safety Analysis