DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization
Xiaojun Tang, Junsong Fan, Chuanchen Luo, Zhaoxiang Zhang, Man Zhang,, and Zongyuan Yang

TL;DR
DDG-Net introduces a discriminability-driven graph network that explicitly models ambiguous and discriminative snippets to improve weakly-supervised temporal action localization, achieving state-of-the-art results.
Contribution
The paper proposes DDG-Net, a novel graph network that enhances snippet discriminability by modeling ambiguous information and employs a feature consistency loss.
Findings
Achieves new state-of-the-art on THUMOS14 and ActivityNet1.2
Effectively models ambiguous and discriminative snippets
Improves snippet representation discriminability
Abstract
Weakly-supervised temporal action localization (WTAL) is a practical yet challenging task. Due to large-scale datasets, most existing methods use a network pretrained in other datasets to extract features, which are not suitable enough for WTAL. To address this problem, researchers design several modules for feature enhancement, which improve the performance of the localization module, especially modeling the temporal relationship between snippets. However, all of them neglect the adverse effects of ambiguous information, which would reduce the discriminability of others. Considering this phenomenon, we propose Discriminability-Driven Graph Network (DDG-Net), which explicitly models ambiguous snippets and discriminative snippets with well-designed connections, preventing the transmission of ambiguous information and enhancing the discriminability of snippet-level representations.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Context-Aware Activity Recognition Systems
MethodsConvolution
