Rethinking deep active learning: Using unlabeled data at model training

Oriane Sim\'eoni; Mateusz Budnik; Yannis Avrithis; Guillaume Gravier

arXiv:1911.08177·cs.CV·November 20, 2019

Rethinking deep active learning: Using unlabeled data at model training

Oriane Sim\'eoni, Mateusz Budnik, Yannis Avrithis, Guillaume Gravier

PDF

Open Access 1 Repo

TL;DR

This paper proposes a novel active learning approach that incorporates unlabeled data during training through unsupervised and semi-supervised methods, leading to significant accuracy improvements in image classification.

Contribution

It introduces the use of unlabeled data during training in active learning, combining unsupervised feature learning and semi-supervised learning, which has not been extensively studied before.

Findings

01

Unlabeled data during training improves accuracy significantly.

02

Semi-supervised learning benefits are consistent across datasets.

03

Effective even with minimal labels, such as one per class.

Abstract

Active learning typically focuses on training a model on few labeled examples alone, while unlabeled ones are only used for acquisition. In this work we depart from this setting by using both labeled and unlabeled data during model training across active learning cycles. We do so by using unsupervised feature learning at the beginning of the active learning pipeline and semi-supervised learning at every active learning cycle, on all available data. The former has not been investigated before in active learning, while the study of latter in the context of deep learning is scarce and recent findings are not conclusive with respect to its benefit. Our idea is orthogonal to acquisition strategies by using more data, much like ensemble methods use more models. By systematically evaluating on a number of popular acquisition strategies and datasets, we find that the use of unlabeled data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

osimeoni/RethinkingDeepActiveLearning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning