Offline Tracking with Object Permanence
Xianzhong Liu, Holger Caesar

TL;DR
This paper introduces an offline tracking model for autonomous driving datasets that effectively handles occluded objects by leveraging object permanence, improving trajectory recovery and enabling more accurate dataset labeling.
Contribution
The proposed model uniquely combines online tracking, re-identification, and track completion modules with map information to recover occluded object trajectories in offline auto labeling.
Findings
Achieves state-of-the-art 3D multi-object tracking performance.
Significantly improves tracking results by recovering occluded objects.
Enhances offline dataset labeling accuracy through occlusion handling.
Abstract
To reduce the expensive labor cost for manual labeling autonomous driving datasets, an alternative is to automatically label the datasets using an offline perception system. However, objects might be temporally occluded. Such occlusion scenarios in the datasets are common yet underexplored in offline auto labeling. In this work, we propose an offline tracking model that focuses on occluded object tracks. It leverages the concept of object permanence which means objects continue to exist even if they are not observed anymore. The model contains three parts: a standard online tracker, a re-identification (Re-ID) module that associates tracklets before and after occlusion, and a track completion module that completes the fragmented tracks. The Re-ID module and the track completion module use the vectorized map as one of the inputs to refine the tracking results with occlusion. The model…
Peer Reviews
Decision·Submitted to ICLR 2024
- This paper aims to solve an essential problem in autonomous driving dataset labelling. - Quantitative and qualitative results show some superiority of the proposed approach over the compared approaches.
- The technical contribution is limited. The proposed approach heavily relies on off-the-shell detectors/tracker, and are inspired from existing approaches a lot (especially for the track completion module), which seems not significant enough as the main contributions by considering the object permanence conception had already been proposed in previous works [1]. - Missing details about the training hyper-parameters for reproduction. - The proposed approach does not show significant improvements
1. The proposed framework is novel, which embeds both the motion and lane map to obtain the final matching matrix, and also fuses the time query embeddings to implement the trajectory regression. 2. This paper evaluates the proposed method under different evaluation setups, which demonstrate the effectiveness of the method more clearly in addressing occlusion situations.
1. Experiments are only performed on the validation split of nuScene dataset, and compared with few SOTA methods, which lack evidence of its effectiveness to some extent. 2. Based on Table1&2, the effect of the motion embedding doesn't seem obvious. Besides, there are no more ablation studies to show the effectiveness of other designs, like the impact of training with augmented GT data instead of tracker outputs, the impact of embedding time query, and so on. 3. The three modules in the propos
The paper shows an elaborated approach to noncausal MOT. The neural models used seem innovative and novel and the attempt to combine tracklet association with a priori knowledge of lane maps in one end-to-end framework is promising.
Unfortunately, the paper is difficult to understand for the reader. Many details are presented in a course to a more detailed manner. After reading the paper, it is not clear what exactly the contribution is. The idea of using object permanence for tracking has been introduced previously (Tokmakov et al.). The three steps of MOT are not new (Zhang, Li, Nevatia, Global Data Association for Multi-Object Tracking Using Network Flows, 2012). The neural models seem novel. However, the claim in the ab
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Vision and Imaging · Autonomous Vehicle Technology and Safety
