ProtoS-ViT: Visual foundation models for sparse self-explainable   classifications

Hugues Turb\'e; Mina Bjelogrlic; Gianmarco Mengaldo; Christian Lovis

arXiv:2406.10025·cs.CV·December 17, 2024·2 cites

ProtoS-ViT: Visual foundation models for sparse self-explainable classifications

Hugues Turb\'e, Mina Bjelogrlic, Gianmarco Mengaldo, Christian Lovis

PDF

Open Access 1 Repo

TL;DR

ProtoS-ViT introduces a new architecture that enhances explainability in visual classification by leveraging pre-trained Vision Transformers and a comprehensive set of evaluation metrics, advancing the development of transparent AI models.

Contribution

The paper proposes a novel architecture that improves explanation quality in prototypical networks using frozen ViT backbones and introduces extensive metrics for evaluating explanation effectiveness.

Findings

01

Outperforms existing prototypical models in explanation quality

02

Provides compact, human-recognizable explanations

03

Effective for both general and biomedical image classification

Abstract

Prototypical networks aim to build intrinsically explainable models based on the linear summation of concepts. Concepts are coherent entities that we, as humans, can recognize and associate with a certain object or entity. However, important challenges remain in the fair evaluation of explanation quality provided by these models. This work first proposes an extensive set of quantitative and qualitative metrics which allow to identify drawbacks in current prototypical networks. It then introduces a novel architecture which provides compact explanations, outperforming current prototypical models in terms of explanation quality. Overall, the proposed architecture demonstrates how frozen pre-trained ViT backbones can be effectively turned into prototypical models for both general and domain-specific tasks, in our case biomedical image classifiers. Code is available at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hturbe/protosvit
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Visualization and Analytics · Image Retrieval and Classification Techniques · Machine Learning and Data Classification

MethodsSparse Evolutionary Training