Generalizable Facial Expression Recognition

Yuhang Zhang; Xiuqi Zheng; Chenyi Liang; Jiani Hu; and Weihong Deng

arXiv:2408.10614·cs.CV·August 21, 2024

Generalizable Facial Expression Recognition

Yuhang Zhang, Xiuqi Zheng, Chenyi Liang, Jiani Hu, and Weihong Deng

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel zero-shot FER method leveraging large model features and learned masks, significantly improving generalization to unseen datasets without target domain data.

Contribution

It proposes a new FER pipeline that uses CLIP-based features and mask learning to enhance zero-shot generalization, avoiding fine-tuning on target domains.

Findings

01

Outperforms state-of-the-art FER methods on five datasets

02

Achieves significant improvements in zero-shot generalization

03

Demonstrates robustness across diverse unseen test sets

Abstract

SOTA facial expression recognition (FER) methods fail on test sets that have domain gaps with the train set. Recent domain adaptation FER methods need to acquire labeled or unlabeled samples of target domains to fine-tune the FER model, which might be infeasible in real-world deployment. In this paper, we aim to improve the zero-shot generalization ability of FER methods on different unseen test sets using only one train set. Inspired by how humans first detect faces and then select expression features, we propose a novel FER pipeline to extract expression-related features from any given face images. Our method is based on the generalizable face features extracted by large models like CLIP. However, it is non-trivial to adapt the general features of CLIP for specific tasks like FER. To preserve the generalization ability of CLIP and the high precision of the FER model, we design a novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zyh-uaiaaaa/generalizable-fer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition

MethodsContrastive Language-Image Pre-training