Making Better Use of Unlabelled Data in Bayesian Active Learning

Freddie Bickford Smith; Adam Foster; Tom Rainforth

arXiv:2404.17249·cs.LG·April 29, 2024

Making Better Use of Unlabelled Data in Bayesian Active Learning

Freddie Bickford Smith, Adam Foster, Tom Rainforth

PDF

Open Access 1 Repo

TL;DR

This paper introduces a semi-supervised Bayesian active learning framework that leverages unlabelled data to improve model performance and data acquisition decisions, outperforming traditional methods and being more scalable.

Contribution

It presents a simple, scalable semi-supervised approach for Bayesian active learning that effectively utilizes unlabelled data, enhancing model accuracy and acquisition efficiency.

Findings

01

Better model performance than traditional Bayesian active learning.

02

More scalable than conventional semi-supervised approaches.

03

Highlights importance of joint study of models and acquisition methods.

Abstract

Fully supervised models are predominant in Bayesian active learning. We argue that their neglect of the information present in unlabelled data harms not just predictive performance but also decisions about what data to acquire. Our proposed solution is a simple framework for semi-supervised Bayesian active learning. We find it produces better-performing models than either conventional Bayesian active learning or semi-supervised learning with randomly acquired data. It is also easier to scale up than the conventional approach. As well as supporting a shift towards semi-supervised models, our findings highlight the importance of studying models and acquisition methods in conjunction.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fbickfordsmith/epig
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Analytical Chemistry and Chromatography