Extracting Label-specific Key Input Features for Neural Code   Intelligence Models

Md Rafiqul Islam Rabin

arXiv:2202.06474·cs.SE·February 15, 2022

Extracting Label-specific Key Input Features for Neural Code Intelligence Models

Md Rafiqul Islam Rabin

PDF

Open Access 2 Repos

TL;DR

This paper introduces a syntax-guided program reduction technique for neural code models, improving explainability by identifying label-specific key features and enhancing trust in predictions.

Contribution

It proposes a syntax-guided reduction method that outperforms syntax-unaware approaches, revealing more meaningful features for model interpretability.

Findings

01

Syntax-guided reduction reduces input program size more effectively.

02

Reduced programs contain more label-specific key features.

03

Key features are more vulnerable to adversarial token renaming.

Abstract

The code intelligence (CI) models are often black-box and do not offer any insights on the input features that they learn for making correct predictions. This opacity may lead to distrust in their prediction and hamper their wider adoption in safety-critical applications. In recent, the program reduction technique is widely being used to identify key input features in order to explain the prediction of CI models. The approach removes irrelevant parts from an input program and keeps the minimal snippets that a CI model needs to maintain its prediction. However, the state-of-the-art approaches mainly use a syntax-unaware program reduction technique that does not follow the syntax of programs, which adds significant overhead to the reduction of input programs and explainability of models. In this paper, we apply a syntax-guided program reduction technique that follows the syntax of input…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Software Engineering Research · Ferroelectric and Negative Capacitance Devices