Finding Action Tubes with a Sparse-to-Dense Framework
Yuxi Li, Weiyao Lin, Tao Wang, John See, Rui Qian, Ning Xu, Limin, Wang, Shugong Xu

TL;DR
This paper introduces an efficient sparse-to-dense framework for spatial-temporal action detection that leverages long-term and short-term information, significantly improving efficiency while maintaining competitive accuracy.
Contribution
The paper presents a novel sparse-to-dense framework with a dynamic feature sampling module for action tube detection, enhancing efficiency and long-term information utilization.
Findings
Achieved competitive results on UCF101-24, JHMDB-21, and UCFSports datasets.
Framework is approximately 7.6 times more efficient than previous methods.
Effectively utilizes both long-term and short-term video information.
Abstract
The task of spatial-temporal action detection has attracted increasing attention among researchers. Existing dominant methods solve this problem by relying on short-term information and dense serial-wise detection on each individual frames or clips. Despite their effectiveness, these methods showed inadequate use of long-term information and are prone to inefficiency. In this paper, we propose for the first time, an efficient framework that generates action tube proposals from video streams with a single forward pass in a sparse-to-dense manner. There are two key characteristics in this framework: (1) Both long-term and short-term sampled information are explicitly utilized in our spatiotemporal network, (2) A new dynamic feature sampling module (DTS) is designed to effectively approximate the tube output while keeping the system tractable. We evaluate the efficacy of our model on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Anomaly Detection Techniques and Applications
