Semi-Siamese Training for Shallow Face Learning
Hang Du, Hailin Shi, Yuchi Liu, Jun Wang, Zhen Lei, Dan Zeng, Tao Mei

TL;DR
This paper introduces Semi-Siamese Training (SST), a novel method designed to improve face recognition models trained on shallow datasets with limited intra-class variation, addressing issues of overfitting and feature collapse.
Contribution
The paper proposes SST, a flexible training approach using Semi-Siamese networks and a gallery queue, effective for shallow face data and compatible with existing architectures.
Findings
Significantly improves shallow face learning performance.
Enhances training stability and reduces overfitting.
Effective also for conventional deep face datasets.
Abstract
Most existing public face datasets, such as MS-Celeb-1M and VGGFace2, provide abundant information in both breadth (large number of IDs) and depth (sufficient number of samples) for training. However, in many real-world scenarios of face recognition, the training dataset is limited in depth, i.e. only two face images are available for each ID. Unlike deep face data, the shallow face data lacks intra-class diversity. As such, it can lead to collapse of feature dimension and consequently the learned network can easily suffer from degeneration and over-fitting in the collapsed dimension. In this paper, we aim to address the problem by introducing a novel training method named Semi-Siamese Training (SST). A pair of Semi-Siamese networks constitute the forward propagation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Face and Expression Recognition · Video Surveillance and Tracking Methods
