Table Tennis Stroke Recognition Using Two-Dimensional Human Pose Estimation
Kaustubh Milind Kulkarni, Sucheth Shenoy

TL;DR
This paper presents a new computer vision system using 2D human pose estimation and deep learning to accurately classify and analyze table tennis strokes, aiding performance improvement.
Contribution
It introduces a large, diverse dataset and a novel neural network model for stroke recognition, demonstrating high accuracy and generalization in table tennis analysis.
Findings
Validation accuracy of 99.37% for stroke classification
Model generalizes well with 98.72% accuracy on unseen player data
Benchmarking of multiple machine learning and deep learning approaches
Abstract
We introduce a novel method for collecting table tennis video data and perform stroke detection and classification. A diverse dataset containing video data of 11 basic strokes obtained from 14 professional table tennis players, summing up to a total of 22111 videos has been collected using the proposed setup. The temporal convolutional neural network model developed using 2D pose estimation performs multiclass classification of these 11 table tennis strokes with a validation accuracy of 99.37%. Moreover, the neural network generalizes well over the data of a player excluded from the training and validation dataset, classifying the fresh strokes with an overall best accuracy of 98.72%. Various model architectures using machine learning and deep learning based approaches have been trained for stroke recognition and their performances have been compared and benchmarked. Inferences such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
