Canonical Face Embeddings
David McNeely-White, Ben Sattelberg, Nathaniel Blanchard, Ross, Beveridge

TL;DR
This paper shows that different CNN-based face verification models learn nearly equivalent embeddings up to rotation, indicating a shared underlying representation despite variations in training and architecture.
Contribution
The study demonstrates that face verification CNNs produce embeddings that are related by simple transformations, revealing a common learned representation across models.
Findings
Embeddings from different CNNs can be aligned with rotation or linear transformations.
Performance drops minimally when aligning embeddings with transformations.
Models learn a shared face representation despite architectural differences.
Abstract
We present evidence that many common convolutional neural networks (CNNs) trained for face verification learn functions that are nearly equivalent under rotation. More specifically, we demonstrate that one face verification model's embeddings (i.e. last-layer activations) can be compared directly to another model's embeddings after only a rotation or linear transformation, with little performance penalty. This finding is demonstrated using IJB-C 1:1 verification across the combinations of ten modern off-the-shelf CNN-based face verification models which vary in training dataset, CNN architecture, method of angular loss calculation, or some combination of the 3. These networks achieve a mean true accept rate of 0.96 at a false accept rate of 0.01. When instead evaluating embeddings generated from two CNNs, where one CNN's embeddings are mapped with a linear transformation, the mean true…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
