Hypernym Bias: Unraveling Deep Classifier Training Dynamics through the Lens of Class Hierarchy
Roman Malashin, Valeria Yachnaya, Alexander Mullin

TL;DR
This paper explores how deep classifiers learn hierarchical class relationships during training, revealing that networks first distinguish broad categories before refining to specific ones, with implications for understanding deep learning dynamics.
Contribution
It introduces a novel framework to track class hierarchy evolution during training, linking neural collapse properties to hierarchical learning stages.
Findings
Networks distinguish hypernyms early in training
Feature manifolds evolve to reflect class hierarchy
Neural collapse properties appear earlier in hypernym space
Abstract
We investigate the training dynamics of deep classifiers by examining how hierarchical relationships between classes evolve during training. Through extensive experiments, we argue that the learning process in classification problems can be understood through the lens of label clustering. Specifically, we observe that networks tend to distinguish higher-level (hypernym) categories in the early stages of training, and learn more specific (hyponym) categories later. We introduce a novel framework to track the evolution of the feature manifold during training, revealing how the hierarchy of class relations emerges and refines across the network layers. Our analysis demonstrates that the learned representations closely align with the semantic structure of the dataset, providing a quantitative description of the clustering process. Notably, we show that in the hypernym label space, certain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques
MethodsALIGN
