Proto2Proto: Can you recognize the car, the way I do?
Monish Keswani, Sriranjani Ramakrishnan, Nishant Reddy, Vineeth N, Balasubramanian

TL;DR
Proto2Proto introduces a novel knowledge distillation approach that transfers interpretability from a teacher prototypical network to a student, enhancing transparency while maintaining competitive accuracy.
Contribution
The paper proposes two new loss functions and metrics to effectively transfer interpretability between prototypical networks through knowledge distillation.
Findings
Successful interpretability transfer demonstrated on CUB-200-2011 and Stanford Cars datasets.
Student models achieve interpretability comparable to teachers with competitive performance.
New metrics effectively measure interpretability transfer quality.
Abstract
Prototypical methods have recently gained a lot of attention due to their intrinsic interpretable nature, which is obtained through the prototypes. With growing use cases of model reuse and distillation, there is a need to also study transfer of interpretability from one model to another. We present Proto2Proto, a novel method to transfer interpretability of one prototypical part network to another via knowledge distillation. Our approach aims to add interpretability to the "dark" knowledge transferred from the teacher to the shallower student model. We propose two novel losses: "Global Explanation" loss and "Patch-Prototype Correspondence" loss to facilitate such a transfer. Global Explanation loss forces the student prototypes to be close to teacher prototypes, and Patch-Prototype Correspondence loss enforces the local representations of the student to be similar to that of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
