3D Convolutional Networks for Action Recognition: Application to Sport   Gesture Recognition

Pierre-Etienne Martin (LaBRI; MPI-EVA; UB); J Benois-Pineau; R; P\'eteri; A Zemmari; J Morlier

arXiv:2204.08460·cs.CV·April 20, 2022·Multi-faceted Deep Learning

3D Convolutional Networks for Action Recognition: Application to Sport Gesture Recognition

Pierre-Etienne Martin (LaBRI, MPI-EVA, UB), J Benois-Pineau, R, P\'eteri, A Zemmari, J Morlier

PDF

TL;DR

This paper explores the use of 3D convolutional networks for classifying continuous sports videos, specifically focusing on actions like table tennis strokes, demonstrating their effectiveness in complex, real-world environments.

Contribution

It applies 3D convolutional networks to continuous, ecological sports videos for action recognition, highlighting their utility in challenging segmentation and classification tasks.

Findings

01

Effective segmentation and classification of sports actions

02

3D convnets handle ecological environment videos well

03

Window-based approaches improve recognition accuracy

Abstract

3D convolutional networks is a good means to perform tasks such as video segmentation into coherent spatio-temporal chunks and classification of them with regard to a target taxonomy. In the chapter we are interested in the classification of continuous video takes with repeatable actions, such as strokes of table tennis. Filmed in a free marker less ecological environment, these videos represent a challenge from both segmentation and classification point of view. The 3D convnets are an efficient tool for solving these problems with window-based approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.