Learning from Label Proportions and Covariate-shifted Instances
Sagalpreet Singh, Navodita Sharma, Shreyas Havaldar, Rishi Saket, Aravindan Raghuveer

TL;DR
This paper introduces methods for learning at the instance level from bag-level labels and covariate-shifted source data, combining domain adaptation with label proportion learning to improve predictive accuracy.
Contribution
It develops hybrid LLP methods that incorporate source instance labels and target bag labels within a domain adaptation framework, with theoretical guarantees and empirical validation.
Findings
Outperforms LLP and domain adaptation baselines on multiple datasets
Provides theoretical bounds on target generalization error
Effectively leverages covariate-shifted source data for improved predictions
Abstract
In many applications, especially due to lack of supervision or privacy concerns, the training data is grouped into bags of instances (feature-vectors) and for each bag we have only an aggregate label derived from the instance-labels in the bag. In learning from label proportions (LLP) the aggregate label is the average of the instance-labels in a bag, and a significant body of work has focused on training models in the LLP setting to predict instance-labels. In practice however, the training data may have fully supervised albeit covariate-shifted source data, along with the usual target data with bag-labels, and we wish to train a good instance-level predictor on the target domain. We call this the covariate-shifted hybrid LLP problem. Fully supervised covariate shifted data often has useful training signals and the goal is to leverage them for better predictive performance in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFuzzy Logic and Control Systems · Machine Learning and Data Classification
