Supernodes and Halos: Loss-Critical Hubs in LLM Feed-Forward Layers
Audrey Cherilyn, Houman Safaai

TL;DR
This paper identifies a small set of loss-critical channels, called supernodes, in transformer feed-forward layers that are crucial for model performance and can be protected during pruning.
Contribution
It introduces a Fisher-style loss proxy to locate supernodes, revealing their importance and organization, and demonstrates their significance through pruning experiments across multiple models.
Findings
Loss sensitivity is concentrated in a small set of channels within each layer.
Protecting supernodes during pruning significantly improves model retention of performance.
The supernode organization pattern persists across different models and increases during pretraining.
Abstract
We study the organization of channel-level importance in transformer feed-forward networks (FFNs). Using a Fisher-style loss proxy (LP) based on activation-gradient second moments, we show that loss sensitivity is concentrated in a small set of channels within each layer. In Llama-3.1-8B, the top 1% of channels per layer accounts for a median of 58.7% of LP mass, with a range of 33.0% to 86.1%. We call these loss-critical channels supernodes. Although FFN layers also contain strong activation outliers, LP-defined supernodes overlap only weakly with activation-defined outliers and are not explained by activation power or weight norms alone. Around this core, we find a weaker but consistent halo structure: some non-supernode channels share the supernodes' write support and show stronger redundancy with the protected core. We use one-shot structured FFN pruning as a diagnostic test of this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
