Identifying Critical Tokens for Accurate Predictions in   Transformer-based Medical Imaging Models

Solha Kang; Joris Vankerschaver; Utku Ozbulak

arXiv:2501.15452·cs.CV·January 28, 2025

Identifying Critical Tokens for Accurate Predictions in Transformer-based Medical Imaging Models

Solha Kang, Joris Vankerschaver, Utku Ozbulak

PDF

TL;DR

This paper introduces Token Insight, a method to identify critical tokens in transformer-based medical imaging models, enhancing interpretability and trust without extra modules, demonstrated on polyp detection tasks.

Contribution

The paper presents a novel, module-free approach to interpret transformer models by identifying important tokens, applicable across various transformer architectures in medical imaging.

Findings

01

Token Insight effectively highlights influential tokens in medical images.

02

The method improves transparency and interpretability of transformer models.

03

Experimental results show increased trust in AI-based medical diagnoses.

Abstract

With the advancements in self-supervised learning (SSL), transformer-based computer vision models have recently demonstrated superior results compared to convolutional neural networks (CNNs) and are poised to dominate the field of artificial intelligence (AI)-based medical imaging in the upcoming years. Nevertheless, similar to CNNs, unveiling the decision-making process of transformer-based models remains a challenge. In this work, we take a step towards demystifying the decision-making process of transformer-based medical imaging models and propose Token Insight, a novel method that identifies the critical tokens that contribute to the prediction made by the model. Our method relies on the principled approach of token discarding native to transformer-based models, requires no additional module, and can be applied to any transformer model. Using the proposed approach, we quantify the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.