D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddings and Denoised Activations
Sanath Narayan, Hisham Cholakkal, Munawar Hayat, Fahad Shahbaz Khan,, Ming-Hsuan Yang, Ling Shao

TL;DR
D2-Net introduces a weakly-supervised framework for temporal action localization that enhances discriminability and robustness of activations through novel loss functions, significantly improving performance on benchmark datasets.
Contribution
The paper presents a new loss formulation combining discriminative and denoising components, improving weakly-supervised action localization accuracy.
Findings
Achieves up to 2.3% mAP improvement on THUMOS14
Outperforms existing methods on multiple benchmarks
Effectively suppresses background noise in activations
Abstract
This work proposes a weakly-supervised temporal action localization framework, called D2-Net, which strives to temporally localize actions using video-level supervision. Our main contribution is the introduction of a novel loss formulation, which jointly enhances the discriminability of latent embeddings and robustness of the output temporal class activations with respect to foreground-background noise caused by weak supervision. The proposed formulation comprises a discriminative and a denoising loss term for enhancing temporal action localization. The discriminative term incorporates a classification loss and utilizes a top-down attention mechanism to enhance the separability of latent foreground-background embeddings. The denoising loss term explicitly addresses the foreground-background noise in class activations by simultaneously maximizing intra-video and inter-video mutual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Stroke Rehabilitation and Recovery
