Sports Video Analysis on Large-Scale Data
Dekun Wu, He Zhao, Xingce Bao, Richard P. Wildes

TL;DR
This paper introduces a large-scale NBA sports video dataset and a unified transformer-based approach for captioning, action recognition, and player identification, addressing previous data and annotation limitations.
Contribution
The paper presents a novel large-scale NBA dataset (NSVA) and a unified feature processing method with minimal labeling, enabling multiple sports video analysis tasks.
Findings
Transformer-based models achieve strong performance on NSVA tasks.
The dataset facilitates research in sports video captioning and action recognition.
Minimal labeling approach reduces annotation efforts significantly.
Abstract
This paper investigates the modeling of automated machine description on sports video, which has seen much progress recently. Nevertheless, state-of-the-art approaches fall quite short of capturing how human experts analyze sports scenes. There are several major reasons: (1) The used dataset is collected from non-official providers, which naturally creates a gap between models trained on those datasets and real-world applications; (2) previously proposed methods require extensive annotation efforts (i.e., player and ball segmentation at pixel level) on localizing useful visual features to yield acceptable results; (3) very few public datasets are available. In this paper, we propose a novel large-scale NBA dataset for Sports Video Analysis (NSVA) with a focus on captioning, to address the above challenges. We also design a unified approach to process raw videos into a stack of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Human Pose and Action Recognition · Sports Analytics and Performance
