Improving Detection of Rare Nodes in Hierarchical Multi-Label Learning
Isaac Xu, Martin Gillis, Ayushi Sharma, Benjamin Misiuk, Craig J. Brown, Thomas Trappenberg

TL;DR
This paper introduces a weighted loss function for hierarchical multi-label classification that improves detection of rare nodes, leading to significant gains in recall and F1 score, especially in challenging scenarios.
Contribution
It proposes a novel loss combining imbalance and focal weighting to better detect rare hierarchical nodes in neural networks.
Findings
Recall improved by up to five times on benchmarks.
Statistically significant F1 score gains.
Effective even with limited data or suboptimal encoders.
Abstract
In hierarchical multi-label classification, a persistent challenge is enabling model predictions to reach deeper levels of the hierarchy for more detailed or fine-grained classifications. This difficulty partly arises from the natural rarity of certain classes (or hierarchical nodes) and the hierarchical constraint that ensures child nodes are almost always less frequent than their parents. To address this, we propose a weighted loss objective for neural networks that combines node-wise imbalance weighting with focal weighting components, the latter leveraging modern quantification of ensemble uncertainties. By emphasizing rare nodes rather than rare observations (data points), and focusing on uncertain nodes for each model output distribution during training, we observe improvements in recall by up to a factor of five on benchmark datasets, along with statistically significant gains in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Imbalanced Data Classification Techniques · Topic Modeling
