Survey of Action Recognition, Spotting and Spatio-Temporal Localization in Soccer -- Current Trends and Research Perspectives

Karolina Seweryn; Anna Wr\'oblewska; Szymon {\L}ukasik

arXiv:2309.12067·cs.CV·November 17, 2025

Survey of Action Recognition, Spotting and Spatio-Temporal Localization in Soccer -- Current Trends and Research Perspectives

Karolina Seweryn, Anna Wr\'oblewska, Szymon {\L}ukasik

PDF

TL;DR

This survey reviews recent advances in soccer action recognition, spotting, and localization, emphasizing multimodal methods, datasets, and future research directions to improve model accuracy and robustness.

Contribution

It provides a comprehensive overview of current methods, datasets, and challenges in soccer action scene understanding, highlighting multimodal approaches and open research questions.

Findings

01

Deep learning methods dominate recent approaches

02

Multimodal data improves recognition accuracy

03

Open challenges include data scarcity and real-time processing

Abstract

Action scene understanding in soccer is a challenging task due to the complex and dynamic nature of the game, as well as the interactions between players. This article provides a comprehensive overview of this task divided into action recognition, spotting, and spatio-temporal action localization, with a particular emphasis on the modalities used and multimodal methods. We explore the publicly available data sources and metrics used to evaluate models' performance. The article reviews recent state-of-the-art methods that leverage deep learning techniques and traditional methods. We focus on multimodal methods, which integrate information from multiple sources, such as video and audio data, and also those that represent one source in various ways. The advantages and limitations of methods are discussed, along with their potential for improving the accuracy and robustness of models.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus