Extracting Label-specific Key Input Features for Neural Code Intelligence Models
Md Rafiqul Islam Rabin

TL;DR
This paper introduces a syntax-guided program reduction technique for neural code models, improving explainability by identifying label-specific key features and enhancing trust in predictions.
Contribution
It proposes a syntax-guided reduction method that outperforms syntax-unaware approaches, revealing more meaningful features for model interpretability.
Findings
Syntax-guided reduction reduces input program size more effectively.
Reduced programs contain more label-specific key features.
Key features are more vulnerable to adversarial token renaming.
Abstract
The code intelligence (CI) models are often black-box and do not offer any insights on the input features that they learn for making correct predictions. This opacity may lead to distrust in their prediction and hamper their wider adoption in safety-critical applications. In recent, the program reduction technique is widely being used to identify key input features in order to explain the prediction of CI models. The approach removes irrelevant parts from an input program and keeps the minimal snippets that a CI model needs to maintain its prediction. However, the state-of-the-art approaches mainly use a syntax-unaware program reduction technique that does not follow the syntax of programs, which adds significant overhead to the reduction of input programs and explainability of models. In this paper, we apply a syntax-guided program reduction technique that follows the syntax of input…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Software Engineering Research · Ferroelectric and Negative Capacitance Devices
