Robin: A Novel Method to Produce Robust Interpreters for Deep   Learning-Based Code Classifiers

Zhen Li; Ruqian Zhang; Deqing Zou; Ning Wang; Yating Li; Shouhuai Xu,; Chen Chen; and Hai Jin

arXiv:2309.10644·cs.SE·September 20, 2023

Robin: A Novel Method to Produce Robust Interpreters for Deep Learning-Based Code Classifiers

Zhen Li, Ruqian Zhang, Deqing Zou, Ning Wang, Yating Li, Shouhuai Xu,, Chen Chen, and Hai Jin

PDF

Open Access 1 Repo

TL;DR

Robin introduces a hybrid approach combining an interpreter and two approximators, leveraging adversarial training and data augmentation to produce robust, high-fidelity explanations for deep learning-based code classifiers, especially under out-of-distribution conditions.

Contribution

Robin is a novel method that creates robust interpreters for deep code classifiers by integrating hybrid structures with adversarial training and data augmentation techniques.

Findings

01

Robin achieves 6.11% higher fidelity on classifiers.

02

Robin's interpreters are 67.22% more faithful to approximators.

03

Robin's approach is 15.87x more robust against out-of-distribution examples.

Abstract

Deep learning has been widely used in source code classification tasks, such as code classification according to their functionalities, code authorship attribution, and vulnerability detection. Unfortunately, the black-box nature of deep learning makes it hard to interpret and understand why a classifier (i.e., classification model) makes a particular prediction on a given example. This lack of interpretability (or explainability) might have hindered their adoption by practitioners because it is not clear when they should or should not trust a classifier's prediction. The lack of interpretability has motivated a number of studies in recent years. However, existing methods are neither robust nor able to cope with out-of-distribution examples. In this paper, we propose a novel method to produce \underline{Rob}ust \underline{in}terpreters for a given deep learning-based code classifier;…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cgcl-codes/robin
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Topic Modeling