Running Event Visualization using Videos from Multiple Cameras
Yeshwanth Napolean, Priadi Teguh Wibowo, Jan van Gemert

TL;DR
This paper presents a novel approach for tracking multiple runners across videos from different cameras during a marathon, using scene text detection and person re-identification without ground truth supervision.
Contribution
It introduces a combined method leveraging scene text detection and re-identification for athlete tracking without relying on labeled training data.
Findings
Scene text detection achieved an F1-score of 74.
Combining text detection with re-identification improved F1-score to 85.8.
Re-training re-identification with identified inliers further increased F1-score to 87.8.
Abstract
Visualizing the trajectory of multiple runners with videos collected at different points in a race could be useful for sports performance analysis. The videos and the trajectories can also aid in athlete health monitoring. While the runners unique ID and their appearance are distinct, the task is not straightforward because the video data does not contain explicit information as to which runners appear in each of the videos. There is no direct supervision of the model in tracking athletes, only filtering steps to remove irrelevant detections. Other factors of concern include occlusion of runners and harsh illumination. To this end, we identify two methods for runner identification at different points of the event, for determining their trajectory. One is scene text detection which recognizes the runners by detecting a unique 'bib number' attached to their clothes and the other is person…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
