Tracking Persons-of-Interest via Unsupervised Representation Adaptation
Shun Zhang, Jia-Bin Huang, Jongwoo Lim, Yihong Gong, Jinjun Wang,, Narendra Ahuja, Ming-Hsuan Yang

TL;DR
This paper introduces a method for multi-face tracking in unconstrained videos by adapting CNN-based face representations to specific videos, improving discriminability and tracking accuracy across large appearance variations.
Contribution
It proposes an unsupervised, video-specific face representation learning approach that adapts pre-trained CNNs using contextual constraints and triplet loss for improved multi-face tracking.
Findings
Significant performance improvement over existing methods
Effective adaptation of CNNs to specific videos using generated training samples
Hierarchical clustering successfully links tracklets across shots
Abstract
Multi-face tracking in unconstrained videos is a challenging problem as faces of one person often appear drastically different in multiple shots due to significant variations in scale, pose, expression, illumination, and make-up. Existing multi-target tracking methods often use low-level features which are not sufficiently discriminative for identifying faces with such large appearance variations. In this paper, we tackle this problem by learning discriminative, video-specific face representations using convolutional neural networks (CNNs). Unlike existing CNN-based approaches which are only trained on large-scale face image datasets offline, we use the contextual constraints to generate a large number of training samples for a given video, and further adapt the pre-trained face CNN to specific videos using discovered training samples. Using these training samples, we optimize the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Video Surveillance and Tracking Methods · Face and Expression Recognition
