The Malignant Tail: Spectral Segregation of Label Noise in Over-Parameterized Networks
Zice Wang

TL;DR
This paper investigates how over-parameterized neural networks segregate signal and noise in spectral space, revealing a failure mode called the Malignant Tail that affects generalization under label noise.
Contribution
It introduces the concept of the Malignant Tail, demonstrates spectral segregation of noise and signal during training, and proposes spectral truncation as a post-hoc method to improve robustness.
Findings
SGD biases noise toward high-frequency orthogonal subspaces.
Spectral truncation effectively prunes noise and improves generalization.
Excess spectral capacity can be a structural liability under label noise.
Abstract
While implicit regularization facilitates benign overfitting in low-noise regimes, recent theoretical work predicts a sharp phase transition to harmful overfitting as the noise-to-signal ratio increases. We experimentally isolate the geometric mechanism of this transition: the Malignant Tail, a failure mode where networks functionally segregate signal and noise, reducing coherent semantic features into low-rank subspaces while pushing stochastic label noise into high-frequency orthogonal components, distinct from systematic or corruption-aligned noise. Through a Spectral Linear Probe of training dynamics, we demonstrate that Stochastic Gradient Descent (SGD) fails to suppress this noise, instead implicitly biasing it toward high-frequency orthogonal subspaces, effectively preserving signal-noise separability. We show that this geometric separation is distinct from simple variance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
