Active Learning with Effective Scoring Functions for Semi-Supervised Temporal Action Localization
Ding Li, Xuebing Yang, Yongqiang Tang, Chenyang Zhang, Wensheng, Zhang

TL;DR
This paper introduces AL-STAL, an active learning approach for semi-supervised temporal action localization that uses uncertainty-based scoring functions to select informative video samples, reducing labeling costs while maintaining high performance.
Contribution
It proposes a novel active learning framework with two scoring functions, TPE and TCI, specifically designed for semi-supervised TAL, improving sample selection efficiency.
Findings
AL-STAL outperforms existing methods on benchmark datasets.
The proposed scoring functions effectively identify informative samples.
AL-STAL achieves competitive results compared to fully-supervised approaches.
Abstract
Temporal Action Localization (TAL) aims to predict both action category and temporal boundary of action instances in untrimmed videos, i.e., start and end time. Fully-supervised solutions are usually adopted in most existing works, and proven to be effective. One of the practical bottlenecks in these solutions is the large amount of labeled training data required. To reduce expensive human label cost, this paper focuses on a rarely investigated yet practical task named semi-supervised TAL and proposes an effective active learning method, named AL-STAL. We leverage four steps for actively selecting video samples with high informativeness and training the localization model, named \emph{Train, Query, Annotate, Append}. Two scoring functions that consider the uncertainty of localization model are equipped in AL-STAL, thus facilitating the video sample rank and selection. One takes entropy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Multimodal Machine Learning Applications
