Improving Detection of Rare Nodes in Hierarchical Multi-Label Learning

Isaac Xu; Martin Gillis; Ayushi Sharma; Benjamin Misiuk; Craig J. Brown; Thomas Trappenberg

arXiv:2602.08986·cs.LG·February 10, 2026

Improving Detection of Rare Nodes in Hierarchical Multi-Label Learning

Isaac Xu, Martin Gillis, Ayushi Sharma, Benjamin Misiuk, Craig J. Brown, Thomas Trappenberg

PDF

Open Access

TL;DR

This paper introduces a weighted loss function for hierarchical multi-label classification that improves detection of rare nodes, leading to significant gains in recall and F1 score, especially in challenging scenarios.

Contribution

It proposes a novel loss combining imbalance and focal weighting to better detect rare hierarchical nodes in neural networks.

Findings

01

Recall improved by up to five times on benchmarks.

02

Statistically significant F1 score gains.

03

Effective even with limited data or suboptimal encoders.

Abstract

In hierarchical multi-label classification, a persistent challenge is enabling model predictions to reach deeper levels of the hierarchy for more detailed or fine-grained classifications. This difficulty partly arises from the natural rarity of certain classes (or hierarchical nodes) and the hierarchical constraint that ensures child nodes are almost always less frequent than their parents. To address this, we propose a weighted loss objective for neural networks that combines node-wise imbalance weighting with focal weighting components, the latter leveraging modern quantification of ensemble uncertainties. By emphasizing rare nodes rather than rare observations (data points), and focusing on uncertain nodes for each model output distribution during training, we observe improvements in recall by up to a factor of five on benchmark datasets, along with statistically significant gains in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Imbalanced Data Classification Techniques · Topic Modeling