Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization
Chen Ju, Peisen Zhao, Siheng Chen, Ya Zhang, Xiaoyun Zhang, Qi Tian

TL;DR
This paper introduces an adaptive mutual supervision framework for weakly-supervised temporal action localization, effectively addressing the incompleteness of class activation sequences and improving localization accuracy through iterative mutual enhancement.
Contribution
The proposed AMS framework innovatively combines two branches with adaptive sampling and mutual pseudo-label supervision to better localize complete action regions.
Findings
Significantly outperforms state-of-the-art on THUMOS14.
Effective in localizing both discriminative and less discriminative action regions.
Iterative mutual supervision enhances localization completeness.
Abstract
Weakly-supervised temporal action localization aims to localize actions in untrimmed videos with only video-level action category labels. Most of previous methods ignore the incompleteness issue of Class Activation Sequences (CAS), suffering from trivial localization results. To solve this issue, we introduce an adaptive mutual supervision framework (AMS) with two branches, where the base branch adopts CAS to localize the most discriminative action regions, while the supplementary branch localizes the less discriminative action regions through a novel adaptive sampler. The adaptive sampler dynamically updates the input of the supplementary branch with a sampling weight sequence negatively correlated with the CAS from the base branch, thereby prompting the supplementary branch to localize the action regions underestimated by the base branch. To promote mutual enhancement between these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Stroke Rehabilitation and Recovery
