How to Train Triplet Networks with 100K Identities?
Chong Wang, Xue Zhang, Xipeng Lan

TL;DR
This paper introduces a subspace learning approach combined with batch hard negative mining to effectively train triplet networks on large-scale face recognition datasets with 100K identities, improving performance and robustness.
Contribution
The paper proposes a novel subspace learning method for triplet networks that addresses the challenge of large-scale identity sets in face recognition.
Findings
Significant performance improvement on MS-Celeb-1M dataset.
Effective handling of noisy data and large-scale retrieval.
Achieved state-of-the-art results without external data.
Abstract
Training triplet networks with large-scale data is challenging in face recognition. Due to the number of possible triplets explodes with the number of samples, previous studies adopt the online hard negative mining(OHNM) to handle it. However, as the number of identities becomes extremely large, the training will suffer from bad local minima because effective hard triplets are difficult to be found. To solve the problem, in this paper, we propose training triplet networks with subspace learning, which splits the space of all identities into subspaces consisting of only similar identities. Combined with the batch OHNM, hard triplets can be found much easier. Experiments on the large-scale MS-Celeb-1M challenge with 100K identities demonstrate that the proposed method can largely improve the performance. In addition, to deal with heavy noise and large-scale retrieval, we also make some…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Domain Adaptation and Few-Shot Learning · Face and Expression Recognition
