A Computer Vision Framework for Multi-Class Detection and Tracking in Soccer Broadcast Footage
Daniel Tshiani

TL;DR
This paper presents a computer vision system that detects and tracks players, referees, and the ball in soccer broadcast footage, enabling affordable and scalable sports analytics without expensive multi-camera setups.
Contribution
It introduces an end-to-end single-camera pipeline combining YOLO and ByteTrack for soccer player and official tracking, demonstrating its effectiveness for lower-budget teams.
Findings
High accuracy in detecting and tracking players and officials
Ball detection remains a significant challenge
Potential to democratize soccer analytics using standard broadcast footage
Abstract
Clubs with access to expensive multi-camera setups or GPS tracking systems gain a competitive advantage through detailed data, whereas lower-budget teams are often unable to collect similar information. This paper examines whether such data can instead be extracted directly from standard broadcast footage using a single-camera computer vision pipeline. This project develops an end-to-end system that combines a YOLO object detector with the ByteTrack tracking algorithm to identify and track players, referees, goalkeepers, and the ball throughout a match. Experimental results show that the pipeline achieves high performance in detecting and tracking players and officials, with strong precision, recall, and mAP50 scores, while ball detection remains the primary challenge. Despite this limitation, our findings demonstrate that AI can extract meaningful player-level spatial information from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Video Surveillance and Tracking Methods · Human Pose and Action Recognition
