Slow Motion Matters: A Slow Motion Enhanced Network for Weakly Supervised Temporal Action Localization
Weiqi Sun, Rui Su, Qian Yu, Dong Xu

TL;DR
This paper introduces SMEN, a framework that enhances weakly supervised temporal action localization by effectively capturing slow-motion action segments, addressing the challenge of detecting slow-paced actions in videos.
Contribution
The paper proposes a novel SMEN framework with a mining module and a localization module to improve sensitivity to slow-motion actions in WTAL models.
Findings
SMEN improves slow-motion action detection accuracy.
Framework is adaptable to existing WTAL networks.
Experimental results show high performance on benchmarks.
Abstract
Weakly supervised temporal action localization (WTAL) aims to localize actions in untrimmed videos with only weak supervision information (e.g. video-level labels). Most existing models handle all input videos with a fixed temporal scale. However, such models are not sensitive to actions whose pace of the movements is different from the ``normal" speed, especially slow-motion action instances, which complete the movements with a much slower speed than their counterparts with a normal speed. Here arises the slow-motion blurred issue: It is hard to explore salient slow-motion information from videos at ``normal" speed. In this paper, we propose a novel framework termed Slow Motion Enhanced Network (SMEN) to improve the ability of a WTAL network by compensating its sensitivity on slow-motion action segments. The proposed SMEN comprises a Mining module and a Localization module. The mining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
