Rethinking Temporal Object Detection from Robotic Perspectives
Xingyu Chen, Zhengxing Wu, Junzhi Yu, Li Wen

TL;DR
This paper introduces new evaluation metrics and methods for video object detection from robotic perspectives, emphasizing temporal continuity and stability, and extends detection methods to single object tracking with practical improvements.
Contribution
It proposes non-reference assessments for temporal performance, develops an online tracklet refinement technique, and extends VID methods to SOT tasks for robotic applications.
Findings
Temporal evaluations complement static AP in VID.
Online tracklet refinement improves detection stability.
Extended VID methods to SOT with competitive results.
Abstract
Video object detection (VID) has been vigorously studied for years but almost all literature adopts a static accuracy-based evaluation, i.e., average precision (AP). From a robotic perspective, the importance of recall continuity and localization stability is equal to that of accuracy, but the AP is insufficient to reflect detectors' performance across time. In this paper, non-reference assessments are proposed for continuity and stability based on object tracklets. These temporal evaluations can serve as supplements to static AP. Further, we develop an online tracklet refinement for improving detectors' temporal performance through short tracklet suppression, fragment filling, and temporal location fusion. In addition, we propose a small-overlap suppression to extend VID methods to single object tracking (SOT) task so that a flexible SOT-by-detection framework is then formed.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques · Advanced Neural Network Applications
