An X-Ray Is Worth 15 Features: Sparse Autoencoders for Interpretable   Radiology Report Generation

Ahmed Abdulaal; Hugo Fry; Nina Monta\~na-Brown; Ayodeji Ijishakin,; Jack Gao; Stephanie Hyland; Daniel C. Alexander; Daniel C. Castro

arXiv:2410.03334·cs.CV·October 7, 2024·2 cites

An X-Ray Is Worth 15 Features: Sparse Autoencoders for Interpretable Radiology Report Generation

Ahmed Abdulaal, Hugo Fry, Nina Monta\~na-Brown, Ayodeji Ijishakin,, Jack Gao, Stephanie Hyland, Daniel C. Alexander, Daniel C. Castro

PDF

Open Access

TL;DR

SAE-Rad introduces sparse autoencoders to decompose vision transformer features into interpretable radiology report components, achieving competitive results with less computational cost and improved interpretability.

Contribution

This work is the first to apply mechanistic interpretability techniques to a multi-modal reasoning task in radiology report generation.

Findings

01

Achieves competitive radiology metrics on MIMIC-CXR

02

Uses significantly less training resources

03

Learns meaningful visual concepts aligned with expert interpretations

Abstract

Radiological services are experiencing unprecedented demand, leading to increased interest in automating radiology report generation. Existing Vision-Language Models (VLMs) suffer from hallucinations, lack interpretability, and require expensive fine-tuning. We introduce SAE-Rad, which uses sparse autoencoders (SAEs) to decompose latent representations from a pre-trained vision transformer into human-interpretable features. Our hybrid architecture combines state-of-the-art SAE advancements, achieving accurate latent reconstructions while maintaining sparsity. Using an off-the-shelf language model, we distil ground-truth reports into radiological descriptions for each SAE feature, which we then compile into a full report for each image, eliminating the need for fine-tuning large models for this task. To the best of our knowledge, SAE-Rad represents the first instance of using mechanistic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Radiomics and Machine Learning in Medical Imaging · Natural Language Processing Techniques

MethodsLinear Layer · Softmax · Multi-Head Attention · Layer Normalization · Dense Connections · Residual Connection · Attention Is All You Need · Vision Transformer