Two-stage Training for Learning from Label Proportions

Jiabin Liu; Bo Wang; Xin Shen; Zhiquan Qi; Yingjie Tian

arXiv:2105.10635·cs.LG·May 25, 2021

Two-stage Training for Learning from Label Proportions

Jiabin Liu, Bo Wang, Xin Shen, Zhiquan Qi, Yingjie Tian

PDF

Open Access

TL;DR

This paper proposes a two-stage training framework for learning from label proportions that improves instance-level classification accuracy by enforcing strict proportion consistency and reducing label noise.

Contribution

It introduces a constrained optimization approach with mixup and symmetric crossentropy to enhance existing deep LLP models, addressing issues of high-entropy distributions and proportion mismatch.

Findings

01

Significant performance improvements on benchmark datasets

02

Effective reduction of label noise and distribution mismatch

03

Framework is model-agnostic and applicable as a post-hoc phase

Abstract

Learning from label proportions (LLP) aims at learning an instance-level classifier with label proportions in grouped training data. Existing deep learning based LLP methods utilize end-to-end pipelines to obtain the proportional loss with Kullback-Leibler divergence between the bag-level prior and posterior class distributions. However, the unconstrained optimization on this objective can hardly reach a solution in accordance with the given proportions. Besides, concerning the probabilistic classifier, this strategy unavoidably results in high-entropy conditional class distributions at the instance level. These issues further degrade the performance of the instance-level classification. In this paper, we regard these problems as noisy pseudo labeling, and instead impose the strict proportion consistency on the classifier with a constrained optimization as a continuous training stage…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Infrastructure Maintenance and Monitoring

MethodsMixup