TL;DR
This paper introduces a CNN-based method that leverages temporal information from image sequences to improve soccer ball detection and tracking, especially under occlusion or disappearance scenarios.
Contribution
It proposes a novel approach combining spatial and temporal convolutional models to enhance detection accuracy and robustness in video sequences.
Findings
Temporal models outperform single-frame detection methods.
Using historical frames improves tracking during occlusion.
The approach achieves high precision and recall in challenging conditions.
Abstract
Soccer ball detection is identified as one of the critical challenges in the RoboCup competition. It requires an efficient vision system capable of handling the task of detection with high precision and recall and providing robust and low inference time. In this work, we present a novel convolutional neural network (CNN) approach to detect the soccer ball in an image sequence. In contrast to the existing methods where only the current frame or an image is used for the detection, we make use of the history of frames. Using history allows to efficiently track the ball in situations where the ball disappears or gets partially occluded in some of the frames. Our approach exploits spatio-temporal correlation and detects the ball based on the trajectory of its movements. We present our results with three convolutional methods, namely temporal convolutional networks (TCN), ConvLSTM, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTanh Activation · Sigmoid Activation · Convolution · ConvLSTM
