Temporal Pyramid Pooling Based Convolutional Neural Networks for Action Recognition
Peng Wang, Yuanzhouhan Cao, Chunhua Shen, Lingqiao Liu, Heng Tao Shen

TL;DR
This paper introduces a novel CNN architecture with temporal pyramid pooling for action recognition in videos, allowing arbitrary frame input and improving performance with less training data.
Contribution
The proposed network enables processing variable-length video inputs using temporal pyramid pooling, avoiding frame sampling and reducing training data requirements.
Findings
Outperforms state-of-the-art on Hollywood2 and HMDB51 datasets.
Requires fewer training data due to leveraging pre-trained image CNNs.
Effectively captures temporal information through pyramid pooling.
Abstract
Encouraged by the success of Convolutional Neural Networks (CNNs) in image classification, recently much effort is spent on applying CNNs to video based action recognition problems. One challenge is that video contains a varying number of frames which is incompatible to the standard input format of CNNs. Existing methods handle this issue either by directly sampling a fixed number of frames or bypassing this issue by introducing a 3D convolutional layer which conducts convolution in spatial-temporal domain. To solve this issue, here we propose a novel network structure which allows an arbitrary number of frames as the network input. The key of our solution is to introduce a module consisting of an encoding layer and a temporal pyramid pooling layer. The encoding layer maps the activation from previous layers to a feature vector suitable for pooling while the temporal pyramid pooling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Gait Recognition and Analysis · Diabetic Foot Ulcer Assessment and Management
MethodsConvolution
