Few-Shot Precise Event Spotting via Unified Multi-Entity Graph and Distillation
Zhaoyu Liu, Kan Jiang, Murong Ma, Zhe Hou, Yun Lin, Jin Song Dong

TL;DR
This paper introduces UMEG-Net, a novel graph-based model that effectively performs precise event spotting in sports videos with limited labeled data by integrating multi-entity graphs and knowledge distillation.
Contribution
The paper presents a unified multi-entity graph network that combines human skeletons and object keypoints, along with multimodal distillation, to improve few-shot event spotting performance.
Findings
Outperforms baseline models in few-shot scenarios
Achieves robust performance with limited labeled data
Provides a scalable solution for precise event spotting
Abstract
Precise event spotting (PES) aims to recognize fine-grained events at exact moments and has become a key component of sports analytics. This task is particularly challenging due to rapid succession, motion blur, and subtle visual differences. Consequently, most existing methods rely on domain-specific, end-to-end training with large labeled datasets and often struggle in few-shot conditions due to their dependence on pixel- or pose-based inputs alone. However, obtaining large labeled datasets is practically hard. We propose a Unified Multi-Entity Graph Network (UMEG-Net) for few-shot PES. UMEG-Net integrates human skeletons and sport-specific object keypoints into a unified graph and features an efficient spatio-temporal extraction module based on advanced GCN and multi-scale temporal shift. To further enhance performance, we employ multimodal distillation to transfer knowledge from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsHuman Pose and Action Recognition · Video Analysis and Summarization · Anomaly Detection Techniques and Applications
