Harnessing Superclasses for Learning from Hierarchical Databases
Nicolas Urbani (Heudiasyc), Sylvain Rousseau (Heudiasyc), Yves, Grandvalet (Heudiasyc), Leonardo Tanzi (Polito)

TL;DR
This paper proposes a hierarchical loss function for classification tasks that leverages class hierarchies to improve accuracy and reduce coarse errors without additional computational cost.
Contribution
It introduces a proper scoring loss that incorporates class hierarchies, enabling consistent classification across different levels of granularity.
Findings
Improves accuracy across benchmarks
Reduces coarse classification errors
No significant computational overhead
Abstract
In many large-scale classification problems, classes are organized in a known hierarchy, typically represented as a tree expressing the inclusion of classes in superclasses. We introduce a loss for this type of supervised hierarchical classification. It utilizes the knowledge of the hierarchy to assign each example not only to a class but also to all encompassing superclasses. Applicable to any feedforward architecture with a softmax output layer, this loss is a proper scoring rule, in that its expectation is minimized by the true posterior class probabilities. This property allows us to simultaneously pursue consistent classification objectives between superclasses and fine-grained classes, and eliminates the need for a performance trade-off between different granularities. We conduct an experimental study on three reference benchmarks, in which we vary the size of the training sets to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Machine Learning and Algorithms
MethodsSoftmax · Sparse Evolutionary Training
