MST-KD: Multiple Specialized Teachers Knowledge Distillation for Fair Face Recognition
Eduarda Caldeira, Jaime S. Cardoso, Ana F. Sequeira, Pedro C. Neto

TL;DR
This paper introduces a framework for face recognition that uses multiple specialized, ethnicity-specific teachers to improve accuracy and reduce bias in student networks, demonstrating the importance of ethnicity-specific features.
Contribution
It proposes a novel multiple specialized teacher framework that distills ethnicity-specific knowledge into a student network, enhancing fairness and performance.
Findings
Increased face recognition accuracy across ethnicities.
Reduced bias compared to balanced dataset teachers.
Specialized teachers outperform balanced ones in knowledge distillation.
Abstract
As in school, one teacher to cover all subjects is insufficient to distill equally robust information to a student. Hence, each subject is taught by a highly specialised teacher. Following a similar philosophy, we propose a multiple specialized teacher framework to distill knowledge to a student network. In our approach, directed at face recognition use cases, we train four teachers on one specific ethnicity, leading to four highly specialized and biased teachers. Our strategy learns a project of these four teachers into a common space and distill that information to a student network. Our results highlighted increased performance and reduced bias for all our experiments. In addition, we further show that having biased/specialized teachers is crucial by showing that our approach achieves better results than when knowledge is distilled from four teachers trained on balanced datasets. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
