Head Pursuit: Probing Attention Specialization in Multimodal Transformers

Lorenzo Basile; Valentino Maiorca; Diego Doimo; Francesco Locatello; Alberto Cazzaniga

arXiv:2510.21518·cs.CV·January 15, 2026

Head Pursuit: Probing Attention Specialization in Multimodal Transformers

Lorenzo Basile, Valentino Maiorca, Diego Doimo, Francesco Locatello, Alberto Cazzaniga

PDF

TL;DR

This paper investigates how individual attention heads in multimodal transformers specialize in specific semantic or visual attributes, providing tools for understanding and editing model behavior.

Contribution

It introduces a signal processing-based interpretability method to identify and manipulate attention heads responsible for specific concepts in multimodal models.

Findings

01

Attention heads show consistent specialization patterns across tasks.

02

Editing 1% of heads can control targeted concepts in outputs.

03

The approach applies to language and vision-language tasks.

Abstract

Language and vision-language models have shown impressive performance across a wide range of tasks, but their internal mechanisms remain only partly understood. In this work, we study how individual attention heads in text-generative models specialize in specific semantic or visual attributes. Building on an established interpretability method, we reinterpret the practice of probing intermediate activations with the final decoding layer through the lens of signal processing. This lets us analyze multiple samples in a principled way and rank attention heads based on their relevance to target concepts. Our results show consistent patterns of specialization at the head level across both unimodal and multimodal transformers. Remarkably, we find that editing as few as 1% of the heads, selected using our method, can reliably suppress or enhance targeted concepts in the model output. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.