Multi-Temporal Convolutions for Human Action Recognition in Videos

Alexandros Stergiou; Ronald Poppe

arXiv:2011.03949·cs.CV·April 1, 2021

Multi-Temporal Convolutions for Human Action Recognition in Videos

Alexandros Stergiou, Ronald Poppe

PDF

1 Repo

TL;DR

This paper introduces multi-temporal convolution blocks for video action recognition, enabling CNNs to capture actions at various time scales efficiently, with reduced computational costs and competitive accuracy.

Contribution

The paper proposes a novel multi-temporal convolution (MTConv) block that extracts features at multiple temporal resolutions and aligns them efficiently within 3D-CNNs.

Findings

01

Achieves competitive accuracy on Kinetics, Moments in Time, and HACS datasets.

02

Reduces computational costs significantly compared to state-of-the-art methods.

03

Demonstrates effective multi-scale temporal feature extraction in video recognition.

Abstract

Effective extraction of temporal patterns is crucial for the recognition of temporally varying actions in video. We argue that the fixed-sized spatio-temporal convolution kernels used in convolutional neural networks (CNNs) can be improved to extract informative motions that are executed at different time scales. To address this challenge, we present a novel spatio-temporal convolution block that is capable of extracting spatio-temporal patterns at multiple temporal resolutions. Our proposed multi-temporal convolution (MTConv) blocks utilize two branches that focus on brief and prolonged spatio-temporal patterns, respectively. The extracted time-varying features are aligned in a third branch, with respect to global motion patterns through recurrent cells. The proposed blocks are lightweight and can be integrated into any 3D-CNN architecture. This introduces a substantial reduction in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alexandrosstergiou/Squeeze-and-Recursion-Temporal-Gates
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Methods3D Convolution · Convolution