Improving Prototypical Visual Explanations with Reward Reweighing,   Reselection, and Retraining

Aaron J. Li; Robin Netzorg; Zhihan Cheng; Zhuoqin Zhang; Bin Yu

arXiv:2307.03887·cs.LG·June 5, 2024·2 cites

Improving Prototypical Visual Explanations with Reward Reweighing, Reselection, and Retraining

Aaron J. Li, Robin Netzorg, Zhihan Cheng, Zhuoqin Zhang, Bin Yu

PDF

Open Access 1 Repo

TL;DR

This paper introduces the R3 framework that enhances the interpretability and accuracy of ProtoPNet by using human feedback to reweigh, reseleect, and retrain the model post hoc.

Contribution

The paper presents a novel offline post-processing method that improves prototype alignment with human preferences and boosts interpretability and accuracy of ProtoPNet.

Findings

01

R3 improves interpretability of ProtoPNet.

02

R3 enhances predictive accuracy.

03

R3 aligns prototypes with human preferences.

Abstract

In recent years, work has gone into developing deep interpretable methods for image classification that clearly attributes a model's output to specific features of the data. One such of these methods is the Prototypical Part Network (ProtoPNet), which attempts to classify images based on meaningful parts of the input. While this architecture is able to produce visually interpretable classifications, it often learns to classify based on parts of the image that are not semantically meaningful. To address this problem, we propose the Reward Reweighing, Reselecting, and Retraining (R3) post-processing framework, which performs three additional corrective updates to a pretrained ProtoPNet in an offline and efficient manner. The first two steps involve learning a reward model based on collected human feedback and then aligning the prototypes with human preferences. The final step is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aaron-jx-li/r3-protopnet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics

MethodsBalanced Selection · ALIGN