Weakly Supervised Face Naming with Symmetry-Enhanced Contrastive Loss
Tingyu Qu, Tinne Tuytelaars, Marie-Francine Moens

TL;DR
This paper introduces SECLA and SECLA-B, novel weakly supervised models for face-name alignment that leverage symmetry-enhanced contrastive learning and a two-stage easy-to-hard training process, achieving state-of-the-art results.
Contribution
The paper proposes SECLA and SECLA-B, innovative contrastive learning models with a two-stage training framework for improved weakly supervised face-name alignment.
Findings
Achieved state-of-the-art results on Labeled Faces in the Wild and Celebrity Together datasets.
Demonstrated effectiveness of symmetry-enhanced contrastive loss in weakly supervised settings.
Showed that learning from easy to hard cases improves alignment performance.
Abstract
We revisit the weakly supervised cross-modal face-name alignment task; that is, given an image and a caption, we label the faces in the image with the names occurring in the caption. Whereas past approaches have learned the latent alignment between names and faces by uncertainty reasoning over a set of images and their respective captions, in this paper, we rely on appropriate loss functions to learn the alignments in a neural network setting and propose SECLA and SECLA-B. SECLA is a Symmetry-Enhanced Contrastive Learning-based Alignment model that can effectively maximize the similarity scores between corresponding faces and names in a weakly supervised fashion. A variation of the model, SECLA-B, learns to align names and faces as humans do, that is, learning from easy to hard cases to further increase the performance of SECLA. More specifically, SECLA-B applies a two-stage learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Weakly Supervised Face Naming with Symmetry-Enhanced Contrastive Loss· youtube
Taxonomy
TopicsFace recognition and analysis · Advanced Image and Video Retrieval Techniques · Generative Adversarial Networks and Image Synthesis
MethodsALIGN
