Semi-Supervised Active Learning with Temporal Output Discrepancy
Siyu Huang, Tianyang Wang, Haoyi Xiong, Jun Huan, Dejing Dou

TL;DR
This paper introduces a novel active learning method using Temporal Output Discrepancy (TOD) to select informative samples based on output differences across model states, reducing annotation costs in deep learning tasks.
Contribution
It proposes a simple, efficient, and task-agnostic active learning approach leveraging TOD to estimate sample loss and improve data selection for labeling.
Findings
Outperforms state-of-the-art active learning methods on image classification.
Effective in semantic segmentation tasks.
Enhances model performance with less labeled data.
Abstract
While deep learning succeeds in a wide range of tasks, it highly depends on the massive collection of annotated data which is expensive and time-consuming. To lower the cost of data annotation, active learning has been proposed to interactively query an oracle to annotate a small proportion of informative samples in an unlabeled dataset. Inspired by the fact that the samples with higher loss are usually more informative to the model than the samples with lower loss, in this paper we present a novel deep active learning approach that queries the oracle for data annotation when the unlabeled sample is believed to incorporate high loss. The core of our approach is a measurement Temporal Output Discrepancy (TOD) that estimates the sample loss by evaluating the discrepancy of outputs given by models at different optimization steps. Our theoretical investigation shows that TOD lower-bounds…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · COVID-19 diagnosis using AI
