Making Your First Choice: To Address Cold Start Problem in Vision Active Learning
Liangyu Chen, Yutong Bai, Siyu Huang, Yongyi Lu, Bihan Wen, Alan L., Yuille, Zongwei Zhou

TL;DR
This paper addresses the cold start problem in vision active learning by leveraging contrastive learning to improve initial data selection, outperforming existing strategies and random selection across multiple datasets.
Contribution
It introduces a contrastive learning-based method to effectively select initial data in vision active learning, mitigating bias and outliers without requiring annotations.
Findings
Initial query outperforms existing active querying strategies.
Proposed method surpasses random selection by a large margin.
Effective across diverse datasets including medical imaging.
Abstract
Active learning promises to improve annotation efficiency by iteratively selecting the most important data to be annotated first. However, we uncover a striking contradiction to this promise: active learning fails to select data as efficiently as random selection at the first few choices. We identify this as the cold start problem in vision active learning, caused by a biased and outlier initial query. This paper seeks to address the cold start problem by exploiting the three advantages of contrastive learning: (1) no annotation is required; (2) label diversity is ensured by pseudo-labels to mitigate bias; (3) typical data is determined by contrastive features to reduce outliers. Experiments are conducted on CIFAR-10-LT and three medical imaging datasets (i.e. Colon Pathology, Abdominal CT, and Blood Cell Microscope). Our initial query not only significantly outperforms existing active…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI
