Two-Stream Convolutional Networks for Action Recognition in Videos
Karen Simonyan, Andrew Zisserman

TL;DR
This paper introduces a two-stream ConvNet architecture for video action recognition, combining spatial and temporal information, and demonstrates its effectiveness on standard benchmarks with improved performance.
Contribution
The paper presents a novel two-stream ConvNet architecture that integrates spatial and temporal data, and employs multi-task learning to enhance action recognition performance.
Findings
Achieved competitive results on UCF-101 and HMDB-51 benchmarks.
ConvNet trained on dense optical flow performs well with limited data.
Multi-task learning improves accuracy across datasets.
Abstract
We investigate architectures of discriminatively trained deep Convolutional Networks (ConvNets) for action recognition in video. The challenge is to capture the complementary information on appearance from still frames and motion between frames. We also aim to generalise the best performing hand-crafted features within a data-driven learning framework. Our contribution is three-fold. First, we propose a two-stream ConvNet architecture which incorporates spatial and temporal networks. Second, we demonstrate that a ConvNet trained on multi-frame dense optical flow is able to achieve very good performance in spite of limited training data. Finally, we show that multi-task learning, applied to two different action classification datasets, can be used to increase the amount of training data and improve the performance on both. Our architecture is trained and evaluated on the standard…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Video Surveillance and Tracking Methods
