On Initial Pools for Deep Active Learning
Akshay L Chandra, Sai Vikas Desai, Chaitanya Devaguptapu, Vineeth N, Balasubramanian

TL;DR
This paper investigates whether using intelligently sampled initial labeled pools, especially with self-supervised methods, can enhance deep active learning performance, but results are inconclusive except for promising VAE-based strategies.
Contribution
It explores the impact of self-supervised and unsupervised sampling strategies for initial pools in deep active learning, a less-studied aspect compared to query functions.
Findings
VAE-based initial pool sampling shows promising trends
No conclusive evidence that intelligent sampling outperforms random sampling in the long run
Experimental setup and methodology were peer-reviewed before conducting experiments
Abstract
Active Learning (AL) techniques aim to minimize the training data required to train a model for a given task. Pool-based AL techniques start with a small initial labeled pool and then iteratively pick batches of the most informative samples for labeling. Generally, the initial pool is sampled randomly and labeled to seed the AL iterations. While recent studies have focused on evaluating the robustness of various query functions in AL, little to no attention has been given to the design of the initial labeled pool for deep active learning. Given the recent successes of learning representations in self-supervised/unsupervised ways, we study if an intelligently sampled initial labeled pool can improve deep AL performance. We investigate the effect of intelligently sampled initial labeled pools, including the use of self-supervised and unsupervised strategies, on deep AL methods. The setup,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Computability, Logic, AI Algorithms
