Estimation of Reliable Proposal Quality for Temporal Action Detection
Junshan Hu, Chaoxu guo, Liansheng Zhuang, Biao Wang, Tiezheng Ge,, Yuning Jiang, Houqiang Li

TL;DR
This paper introduces a novel approach for temporal action detection that aligns classification and localization tasks by estimating proposal quality from moment and region perspectives, leading to state-of-the-art results.
Contribution
It proposes Boundary Evaluate Module and Region Evaluate Module to improve proposal quality estimation, addressing temporal misalignment in anchor-free TAD methods.
Findings
Improves THUMOS14 mAP by 3.6% to 63.6%.
Enhances ActivityNet-1.3 mAP to 36.2%.
Modules are compatible with existing frameworks.
Abstract
Temporal action detection (TAD) aims to locate and recognize the actions in an untrimmed video. Anchor-free methods have made remarkable progress which mainly formulate TAD into two tasks: classification and localization using two separate branches. This paper reveals the temporal misalignment between the two tasks hindering further progress. To address this, we propose a new method that gives insights into moment and region perspectives simultaneously to align the two tasks by acquiring reliable proposal quality. For the moment perspective, Boundary Evaluate Module (BEM) is designed which focuses on local appearance and motion evolvement to estimate boundary quality and adopts a multi-scale manner to deal with varied action durations. For the region perspective, we introduce Region Evaluate Module (REM) which uses a new and efficient sampling method for proposal feature representation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Multimodal Machine Learning Applications
MethodsALIGN
