Estimation of Reliable Proposal Quality for Temporal Action Detection

Junshan Hu; Chaoxu guo; Liansheng Zhuang; Biao Wang; Tiezheng Ge,; Yuning Jiang; Houqiang Li

arXiv:2204.11695·cs.CV·November 22, 2022

Estimation of Reliable Proposal Quality for Temporal Action Detection

Junshan Hu, Chaoxu guo, Liansheng Zhuang, Biao Wang, Tiezheng Ge,, Yuning Jiang, Houqiang Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel approach for temporal action detection that aligns classification and localization tasks by estimating proposal quality from moment and region perspectives, leading to state-of-the-art results.

Contribution

It proposes Boundary Evaluate Module and Region Evaluate Module to improve proposal quality estimation, addressing temporal misalignment in anchor-free TAD methods.

Findings

01

Improves THUMOS14 mAP by 3.6% to 63.6%.

02

Enhances ActivityNet-1.3 mAP to 36.2%.

03

Modules are compatible with existing frameworks.

Abstract

Temporal action detection (TAD) aims to locate and recognize the actions in an untrimmed video. Anchor-free methods have made remarkable progress which mainly formulate TAD into two tasks: classification and localization using two separate branches. This paper reveals the temporal misalignment between the two tasks hindering further progress. To address this, we propose a new method that gives insights into moment and region perspectives simultaneously to align the two tasks by acquiring reliable proposal quality. For the moment perspective, Boundary Evaluate Module (BEM) is designed which focuses on local appearance and motion evolvement to estimate boundary quality and adopts a multi-scale manner to deal with varied action durations. For the region perspective, we introduce Region Evaluate Module (REM) which uses a new and efficient sampling method for proposal feature representation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

junshan233/brem
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Multimodal Machine Learning Applications

MethodsALIGN