Probing Model Signal-Awareness via Prediction-Preserving Input   Minimization

Sahil Suneja; Yunhui Zheng; Yufan Zhuang; Jim Laredo; Alessandro; Morari

arXiv:2011.14934·cs.SE·June 24, 2021

Probing Model Signal-Awareness via Prediction-Preserving Input Minimization

Sahil Suneja, Yunhui Zheng, Yufan Zhuang, Jim Laredo, Alessandro, Morari

PDF

TL;DR

This paper introduces a method called prediction-preserving input minimization (P2IM) to evaluate how well AI models for source code vulnerability detection rely on true vulnerability signals, revealing potential over-reliance on dataset noise.

Contribution

The paper proposes P2IM and a new metric, Signal-aware Recall (SAR), to systematically assess model signal-awareness in source code understanding tasks.

Findings

01

Models show a significant drop in Signal-aware Recall when minimal code snippets are used.

02

P2IM uncovers reliance on dataset noise rather than true vulnerability signals.

03

SAR provides a data-driven measure of a model's signal-awareness.

Abstract

This work explores the signal awareness of AI models for source code understanding. Using a software vulnerability detection use case, we evaluate the models' ability to capture the correct vulnerability signals to produce their predictions. Our prediction-preserving input minimization (P2IM) approach systematically reduces the original source code to a minimal snippet which a model needs to maintain its prediction. The model's reliance on incorrect signals is then uncovered when the vulnerability in the original code is missing in the minimal snippet, both of which the model however predicts as being vulnerable. We measure the signal awareness of models using a new metric we propose- Signal-aware Recall (SAR). We apply P2IM on three different neural network architectures across multiple datasets. The results show a sharp drop in the model's Recall from the high 90s to sub-60s with the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.