Cross Architecture Distillation for Face Recognition
Weisong Zhao, Xiangyu Zhu, Zhixiang He, Xiao-Yu Zhang, Zhen Lei

TL;DR
This paper proposes a novel cross-architecture knowledge distillation method for face recognition, enabling effective transfer from Transformer teachers to CNN students by aligning spatial features and enhancing distillation-specific learning.
Contribution
It introduces URFM for spatial feature alignment and APT for distillation-specific knowledge management, addressing key challenges in cross-architecture face recognition distillation.
Findings
Outperforms existing methods on face benchmarks
Effective knowledge transfer from Transformers to CNNs
Improved face verification accuracy
Abstract
Transformers have emerged as the superior choice for face recognition tasks, but their insufficient platform acceleration hinders their application on mobile devices. In contrast, Convolutional Neural Networks (CNNs) capitalize on hardware-compatible acceleration libraries. Consequently, it has become indispensable to preserve the distillation efficacy when transferring knowledge from a Transformer-based teacher model to a CNN-based student model, known as Cross-Architecture Knowledge Distillation (CAKD). Despite its potential, the deployment of CAKD in face recognition encounters two challenges: 1) the teacher and student share disparate spatial information for each pixel, obstructing the alignment of feature space, and 2) the teacher network is not trained in the role of a teacher, lacking proficiency in handling distillation-specific knowledge. To surmount these two constraints, 1)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Biometric Identification and Security · Face and Expression Recognition
MethodsKnowledge Distillation
