Hypernym Bias: Unraveling Deep Classifier Training Dynamics through the   Lens of Class Hierarchy

Roman Malashin; Valeria Yachnaya; Alexander Mullin

arXiv:2502.12125·cs.AI·February 18, 2025

Hypernym Bias: Unraveling Deep Classifier Training Dynamics through the Lens of Class Hierarchy

Roman Malashin, Valeria Yachnaya, Alexander Mullin

PDF

Open Access

TL;DR

This paper explores how deep classifiers learn hierarchical class relationships during training, revealing that networks first distinguish broad categories before refining to specific ones, with implications for understanding deep learning dynamics.

Contribution

It introduces a novel framework to track class hierarchy evolution during training, linking neural collapse properties to hierarchical learning stages.

Findings

01

Networks distinguish hypernyms early in training

02

Feature manifolds evolve to reflect class hierarchy

03

Neural collapse properties appear earlier in hypernym space

Abstract

We investigate the training dynamics of deep classifiers by examining how hierarchical relationships between classes evolve during training. Through extensive experiments, we argue that the learning process in classification problems can be understood through the lens of label clustering. Specifically, we observe that networks tend to distinguish higher-level (hypernym) categories in the early stages of training, and learn more specific (hyponym) categories later. We introduce a novel framework to track the evolution of the feature manifold during training, revealing how the hierarchy of class relations emerges and refines across the network layers. Our analysis demonstrates that the learned representations closely align with the semantic structure of the dataset, providing a quantitative description of the clustering process. Notably, we show that in the hypernym label space, certain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImbalanced Data Classification Techniques

MethodsALIGN