Robin: A Novel Method to Produce Robust Interpreters for Deep Learning-Based Code Classifiers
Zhen Li, Ruqian Zhang, Deqing Zou, Ning Wang, Yating Li, Shouhuai Xu,, Chen Chen, and Hai Jin

TL;DR
Robin introduces a hybrid approach combining an interpreter and two approximators, leveraging adversarial training and data augmentation to produce robust, high-fidelity explanations for deep learning-based code classifiers, especially under out-of-distribution conditions.
Contribution
Robin is a novel method that creates robust interpreters for deep code classifiers by integrating hybrid structures with adversarial training and data augmentation techniques.
Findings
Robin achieves 6.11% higher fidelity on classifiers.
Robin's interpreters are 67.22% more faithful to approximators.
Robin's approach is 15.87x more robust against out-of-distribution examples.
Abstract
Deep learning has been widely used in source code classification tasks, such as code classification according to their functionalities, code authorship attribution, and vulnerability detection. Unfortunately, the black-box nature of deep learning makes it hard to interpret and understand why a classifier (i.e., classification model) makes a particular prediction on a given example. This lack of interpretability (or explainability) might have hindered their adoption by practitioners because it is not clear when they should or should not trust a classifier's prediction. The lack of interpretability has motivated a number of studies in recent years. However, existing methods are neither robust nor able to cope with out-of-distribution examples. In this paper, we propose a novel method to produce \underline{Rob}ust \underline{in}terpreters for a given deep learning-based code classifier;…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Topic Modeling
