TT4D: A Pipeline and Dataset for Table Tennis 4D Reconstruction From Monocular Videos
Nima Rahmanian, Daniel Kienzle, Thomas Gossard, Dvij Kalaria, Rainer Lienhart, Shankar Sastry

TL;DR
TT4D introduces a comprehensive dataset and pipeline for 3D reconstruction of table tennis gameplay from monocular videos, enabling advanced analysis and virtual replay.
Contribution
The paper presents a novel lift-first reconstruction pipeline and a large-scale, high-fidelity dataset for table tennis from broadcast videos, overcoming occlusion challenges.
Findings
Successfully reconstructs 3D ball trajectories and spins from monocular videos.
Enables estimation of racket pose and velocity at impact.
Facilitates training of generative models for rally simulation.
Abstract
We present TT4D, a large-scale, high-fidelity table tennis dataset. It provides hours of reconstructed singles and doubles gameplay from monocular broadcast videos, featuring multimodal annotations like high-quality camera calibrations, precise 3D ball positions, ball spin, time segmentation, and 3D human meshes over time. This rich data provides a new foundation for virtual replay, in-depth player analysis, and robot learning. The dataset's combination of scale and precision is achieved through a novel reconstruction pipeline. Prior methods first partition a game sequence into individual shot segments based on the 2D ball track, and only then attempt reconstruction. However, 2D-based time segmentation collapses under occlusion and varied camera viewpoints, preventing reliable reconstruction. We invert this paradigm by first lifting the entire unsegmented 2D ball track to 3D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
