Soccer on Your Tabletop
Konstantinos Rematas, Ira Kemelmacher-Shlizerman, Brian Curless, Steve, Seitz

TL;DR
This paper introduces a system that converts monocular soccer videos into interactive 3D reconstructions, enabling immersive viewing and AR experiences by estimating player depth using a CNN trained on game-derived data.
Contribution
The paper presents a novel CNN-based depth estimation method for soccer players, trained on synthetic data, to achieve accurate 3D reconstructions from monocular videos.
Findings
Effective depth estimation on synthetic and real footage
Improved 3D reconstruction quality over existing methods
Enables interactive and AR viewing of soccer games
Abstract
We present a system that transforms a monocular video of a soccer game into a moving 3D reconstruction, in which the players and field can be rendered interactively with a 3D viewer or through an Augmented Reality device. At the heart of our paper is an approach to estimate the depth map of each player, using a CNN that is trained on 3D player data extracted from soccer video games. We compare with state of the art body pose and depth estimation techniques, and show results on both synthetic ground truth benchmarks, and real YouTube soccer footage.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Video Analysis and Summarization · Advanced Image Processing Techniques
