TL;DR
This paper presents a self-supervised learning framework for multi-object tracking of passengers and baggage at airport security checkpoints using overhead cameras, enhancing detection accuracy and multi-view tracking efficiency.
Contribution
The novel SSL-based detection and multi-view association approach improves tracking accuracy without increasing inference time in security checkpoint scenarios.
Findings
Detection accuracy improved by up to 42%.
Multi-object tracking accuracy reached 89%.
Inference time remained under 15 ms.
Abstract
We introduce a novel framework to track multiple objects in overhead camera videos for airport checkpoint security scenarios where targets correspond to passengers and their baggage items. We propose a Self-Supervised Learning (SSL) technique to provide the model information about instance segmentation uncertainty from overhead images. Our SSL approach improves object detection by employing a test-time data augmentation and a regression-based, rotation-invariant pseudo-label refinement technique. Our pseudo-label generation method provides multiple geometrically-transformed images as inputs to a Convolutional Neural Network (CNN), regresses the augmented detections generated by the network to reduce localization errors, and then clusters them using the mean-shift algorithm. The self-supervised detector model is used in a single-camera tracking algorithm to generate temporal identifiers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsEmirates Airlines Office in Dubai
