Towards a General Model of Knowledge for Facial Analysis by Multi-Source   Transfer Learning

Valentin Vielzeuf; Alexis Lechervy; St\'ephane Pateux; Fr\'ed\'eric; Jurie

arXiv:1911.03222·cs.LG·November 11, 2019·5 cites

Towards a General Model of Knowledge for Facial Analysis by Multi-Source Transfer Learning

Valentin Vielzeuf, Alexis Lechervy, St\'ephane Pateux, Fr\'ed\'eric, Jurie

PDF

Open Access

TL;DR

This paper introduces a multi-source transfer learning framework for facial analysis that combines models into a common embedding and distills this into a lightweight, versatile student model that outperforms its teachers on various tasks.

Contribution

It presents a novel two-step approach using auto-encoder fusion and model distillation to create a compact, generalizable facial analysis model from multiple sources.

Findings

01

The student model outperforms its teacher on new facial analysis tasks.

02

The approach achieves state-of-the-art results on 15 facial analysis tasks.

03

The student model has 75 times fewer parameters than the teacher.

Abstract

This paper proposes a step toward obtaining general models of knowledge for facial analysis, by addressing the question of multi-source transfer learning. More precisely, the proposed approach consists in two successive training steps: the first one consists in applying a combination operator to define a common embedding for the multiple sources materialized by different existing trained models. The proposed operator relies on an auto-encoder, trained on a large dataset, efficient both in terms of compression ratio and transfer learning performance. In a second step we exploit a distillation approach to obtain a lightweight student model mimicking the collection of the fused existing models. This model outperforms its teacher on novel tasks, achieving results on par with state-of-the-art methods on 15 facial analysis tasks (and domains), at an affordable training cost. Moreover, this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Speech and Audio Processing · Domain Adaptation and Few-Shot Learning