Boundary-Recovering Network for Temporal Action Detection
Jihwan Kim, Jaehyun Choi, Yerim Jeon, Jae-Pil Heo

TL;DR
The paper introduces the Boundary-Recovering Network (BRN), which enhances temporal action detection by addressing boundary ambiguity and the vanishing boundary problem through a novel scale-time feature construction and exchange mechanism.
Contribution
BRN introduces a new scale-time feature construction and exchange method to recover ambiguous action boundaries in temporal detection tasks.
Findings
Outperforms state-of-the-art on ActivityNet-v1.3 and THUMOS14
Reduces the vanishing boundary problem significantly
Improves boundary localization accuracy
Abstract
Temporal action detection (TAD) is challenging, yet fundamental for real-world video applications. Large temporal scale variation of actions is one of the most primary difficulties in TAD. Naturally, multi-scale features have potential in localizing actions of diverse lengths as widely used in object detection. Nevertheless, unlike objects in images, actions have more ambiguity in their boundaries. That is, small neighboring objects are not considered as a large one while short adjoining actions can be misunderstood as a long one. In the coarse-to-fine feature pyramid via pooling, these vague action boundaries can fade out, which we call 'vanishing boundary problem'. To this end, we propose Boundary-Recovering Network (BRN) to address the vanishing boundary problem. BRN constructs scale-time features by introducing a new axis called scale dimension by interpolating multi-scale features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Video Surveillance and Tracking Methods
