Temporal Action Proposal Generation with Background Constraint
Haosen Yang, Wenhao Wu, Lining Wang, Sheng Jin, Boyang Xia, Hongxun, Yao, Hujie Huang

TL;DR
This paper introduces a Background Constraint Network (BCNet) that enhances temporal action proposal generation by leveraging background prediction scores to suppress low-quality proposals, improving performance on benchmark datasets.
Contribution
The paper proposes a novel Background Constraint idea and a BCNet model that incorporate background information to improve proposal confidence evaluation in TAPG.
Findings
Outperforms state-of-the-art methods on ActivityNet-1.3 and THUMOS14.
Effectively suppresses low-quality proposals using background scores.
Achieves remarkable results in temporal action localization when combined with existing classifiers.
Abstract
Temporal action proposal generation (TAPG) is a challenging task that aims to locate action instances in untrimmed videos with temporal boundaries. To evaluate the confidence of proposals, the existing works typically predict action score of proposals that are supervised by the temporal Intersection-over-Union (tIoU) between proposal and the ground-truth. In this paper, we innovatively propose a general auxiliary Background Constraint idea to further suppress low-quality proposals, by utilizing the background prediction score to restrict the confidence of proposals. In this way, the Background Constraint concept can be easily plug-and-played into existing TAPG methods (e.g., BMN, GTAD). From this perspective, we propose the Background Constraint Network (BCNet) to further take advantage of the rich information of action and background. Specifically, we introduce an Action-Background…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHuman Pose and Action Recognition · Video Analysis and Summarization · Multimodal Machine Learning Applications
