Deep Heterogeneous Hashing for Face Video Retrieval
Shishi Qiao, Ruiping Wang, Shiguang Shan, and Xilin Chen

TL;DR
This paper introduces a deep heterogeneous hashing method that effectively unifies face image and video representations on a common Hamming space, improving face video retrieval performance by leveraging Riemannian manifold modeling and end-to-end learning.
Contribution
It proposes an end-to-end deep hashing framework that integrates Riemannian kernel mapping and structured matrix backpropagation for face video retrieval.
Findings
Achieves competitive performance on challenging datasets.
Effectively models face videos using covariance matrices on Riemannian manifolds.
Unifies heterogeneous face representations into a common Hamming space.
Abstract
Retrieving videos of a particular person with face image as a query via hashing technique has many important applications. While face images are typically represented as vectors in Euclidean space, characterizing face videos with some robust set modeling techniques (e.g. covariance matrices as exploited in this study, which reside on Riemannian manifold), has recently shown appealing advantages. This hence results in a thorny heterogeneous spaces matching problem. Moreover, hashing with handcrafted features as done in many existing works is clearly inadequate to achieve desirable performance for this task. To address such problems, we present an end-to-end Deep Heterogeneous Hashing (DHH) method that integrates three stages including image feature learning, video modeling, and heterogeneous hashing in a single framework, to learn unified binary codes for both face images and videos. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
