Explaining Cross-Domain Recognition with Interpretable Deep Classifier
Yiheng Zhang, Ting Yao, Zhaofan Qiu, Tao Mei

TL;DR
This paper introduces an Interpretable Deep Classifier (IDC) that explains cross-domain recognition by identifying influential source samples, enhancing interpretability without sacrificing accuracy, and enabling effective domain adaptation with minimal data.
Contribution
The paper proposes a novel IDC model with a differentiable memory bank for interpretability in domain adaptation, maintaining accuracy while providing explanations.
Findings
IDC improves model interpretability with minimal accuracy loss.
Using IDC-selected data achieves better results with less training data.
IDC enhances domain adaptation by identifying key source samples.
Abstract
The recent advances in deep learning predominantly construct models in their internal representations, and it is opaque to explain the rationale behind and decisions to human users. Such explainability is especially essential for domain adaptation, whose challenges require developing more adaptive models across different domains. In this paper, we ask the question: how much each sample in source domain contributes to the network's prediction on the samples from target domain. To address this, we devise a novel Interpretable Deep Classifier (IDC) that learns the nearest source samples of a target sample as evidence upon which the classifier makes the decision. Technically, IDC maintains a differentiable memory bank for each category and the memory slot derives a form of key-value pair. The key records the features of discriminative source samples and the value stores the corresponding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning in Healthcare · Domain Adaptation and Few-Shot Learning
