TL;DR
This paper introduces a long-term face tracking system for crowded scenes that combines face detection, recognition, and a novel rank-based verification to improve tracking duration and accuracy in challenging conditions.
Contribution
The paper presents a new long-term multi-face tracking architecture using rank-based face verification and a correction module, enhancing robustness and track length in crowded scenarios.
Findings
Achieves up to 50% longer tracks than existing deep learning trackers.
Validates the effectiveness of each module through experiments.
Introduces novel metrics for evaluating long-term tracking.
Abstract
Most current multi-object trackers focus on short-term tracking, and are based on deep and complex systems that often cannot operate in real-time, making them impractical for video-surveillance. In this paper we present a long-term, multi-face tracking architecture conceived for working in crowded contexts where faces are often the only visible part of a person. Our system benefits from advances in the fields of face detection and face recognition to achieve long-term tracking, and is particularly unconstrained to the motion and occlusions of people. It follows a tracking-by-detection approach, combining a fast short-term visual tracker with a novel online tracklet reconnection strategy grounded on rank-based face verification. The proposed rank-based constraint favours higher inter-class distance among tracklets, and reduces the propagation of errors due to wrong reconnections.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
