Theoretical Proportion Label Perturbation for Learning from Label Proportions in Large Bags
Shunsuke Kubo, Shinnosuke Matsuo, Daiki Suehiro, Kazuhiro Terada,, Hiroaki Ito, Akihiko Yoshizawa, Ryoma Bise

TL;DR
This paper introduces a novel perturbation method for label proportions in large bags within weakly supervised learning, enabling effective training despite GPU memory constraints and reducing overfitting.
Contribution
It proposes a proportion label perturbation technique based on the hypergeometric distribution and loss weighting to improve learning from large bags in LLP.
Findings
Achieves classification accuracy comparable to traditional methods
Effectively mitigates overfitting caused by mini-bag sampling
Enables training on large bags with limited GPU memory
Abstract
Learning from label proportions (LLP) is a kind of weakly supervised learning that trains an instance-level classifier from label proportions of bags, which consist of sets of instances without using instance labels. A challenge in LLP arises when the number of instances in a bag (bag size) is numerous, making the traditional LLP methods difficult due to GPU memory limitations. This study aims to develop an LLP method capable of learning from bags with large sizes. In our method, smaller bags (mini-bags) are generated by sampling instances from large-sized bags (original bags), and these mini-bags are used in place of the original bags. However, the proportion of a mini-bag is unknown and differs from that of the original bag, leading to overfitting. To address this issue, we propose a perturbation method for the proportion labels of sampled mini-bags to mitigate overfitting to noisy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndustrial Vision Systems and Defect Detection
