Learning from Hard Labels with Additional Supervision on Non-Hard-Labeled Classes
Kosuke Sugiyama, Masato Uchida

TL;DR
This paper develops a theoretical framework for understanding how additional supervision, like confidence scores, can improve classification models by refining soft labels, especially in data-scarce scenarios.
Contribution
It introduces a novel theoretical analysis showing the importance of distribution information over non-hard-labeled classes and how additional supervision enhances generalization.
Findings
Additional supervision influences the direction of soft label refinement.
The mixing coefficient controls the step size in label adjustment.
Designing effective additional supervision improves classification accuracy.
Abstract
In scenarios where training data is limited due to observation costs or data scarcity, enriching the label information associated with each instance becomes crucial for building high-accuracy classification models. In such contexts, it is often feasible to obtain not only hard labels but also {\it additional supervision}, such as the confidences for the hard labels. This setting naturally raises fundamental questions: {\it What kinds of additional supervision are intrinsically beneficial?} And {\it how do they contribute to improved generalization performance?} To address these questions, we propose a theoretical framework that treats both hard labels and additional supervision as probability distributions, and constructs soft labels through their affine combination. Our theoretical analysis reveals that the essential component of additional supervision is not the confidence score of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics · Educational Technology and Assessment
