AdaSpot: Spend Resolution Where It Matters for Precise Event Spotting
Artur Xarles, Sergio Escalera, Thomas B. Moeslund, Albert Clap\'es

TL;DR
AdaSpot introduces an adaptive, efficient framework for precise event spotting in videos by selectively processing regions of interest at high resolution, significantly improving accuracy and efficiency over existing methods.
Contribution
It proposes a novel adaptive processing framework that selectively enhances regions of interest in low-resolution videos for precise event localization, maintaining efficiency and spatio-temporal consistency.
Findings
Achieves state-of-the-art performance on PES benchmarks.
Maintains high accuracy with marginal computational overhead.
Outperforms uniform high-resolution processing in efficiency.
Abstract
Precise Event Spotting aims to localize fast-paced actions or events in videos with high temporal precision, a key task for applications in sports analytics, robotics, and autonomous systems. Existing methods typically process all frames uniformly, overlooking the inherent spatio-temporal redundancy in video data. This leads to redundant computation on non-informative regions while limiting overall efficiency. To remain tractable, they often spatially downsample inputs, losing fine-grained details crucial for precise localization. To address these limitations, we propose \textbf{AdaSpot}, a simple yet effective framework that processes low-resolution videos to extract global task-relevant features while adaptively selecting the most informative region-of-interest in each frame for high-resolution processing. The selection is performed via an unsupervised, task-aware strategy that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Generative Adversarial Networks and Image Synthesis · Robot Manipulation and Learning
