Investigating Neuron Ablation in Attention Heads: The Case for Peak   Activation Centering

Nicholas Pochinkov; Ben Pasero; Skylar Shibayama

arXiv:2408.17322·cs.LG·September 2, 2024·2 cites

Investigating Neuron Ablation in Attention Heads: The Case for Peak Activation Centering

Nicholas Pochinkov, Ben Pasero, Skylar Shibayama

PDF

Open Access 1 Repo

TL;DR

This paper explores how different neuron ablation techniques affect transformer models' performance, introducing a novel 'peak ablation' method and analyzing its effectiveness in understanding attention mechanisms.

Contribution

It introduces 'peak ablation' as a new neuron ablation technique and compares it with existing methods to better interpret attention mechanisms in transformers.

Findings

01

Peak ablation often causes less performance degradation.

02

Resampling generally leads to more performance loss.

03

Different ablation methods vary in effectiveness across models.

Abstract

The use of transformer-based models is growing rapidly throughout society. With this growth, it is important to understand how they work, and in particular, how the attention mechanisms represent concepts. Though there are many interpretability methods, many look at models through their neuronal activations, which are poorly understood. We describe different lenses through which to view neuron activations, and investigate the effectiveness in language models and vision transformers through various methods of neural ablation: zero ablation, mean ablation, activation resampling, and a novel approach we term 'peak ablation'. Through experimental analysis, we find that in different regimes and models, each method can offer the lowest degradation of model performance compared to other methods, with resampling usually causing the most significant performance deterioration. We make our code…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nickypro/investigating-ablation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFunctional Brain Connectivity Studies

MethodsSoftmax · Attention Is All You Need