More comprehensive facial inversion for more effective expression   recognition

Jiawei Mao; Guangyi Zhao; Yuanqi Chang; Xuesong Yin; Xiaogang Peng,; Rui Xu

arXiv:2211.13564·cs.CV·January 4, 2023

More comprehensive facial inversion for more effective expression recognition

Jiawei Mao, Guangyi Zhao, Yuanqi Chang, Xuesong Yin, Xiaogang Peng,, Rui Xu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel generative approach using image inversion and a transformer-based model to improve facial expression recognition by extracting richer facial features.

Contribution

It proposes a new inversion-based generative method with a transformer architecture and feature modulation for enhanced FER performance.

Findings

01

Achieves state-of-the-art facial inversion results on FFHQ and CelebA-HQ.

02

Attains competitive FER results on RAF-DB, SFEW, and AffectNet.

03

Demonstrates the effectiveness of the proposed ASIT in feature extraction.

Abstract

Facial expression recognition (FER) plays a significant role in the ubiquitous application of computer vision. We revisit this problem with a new perspective on whether it can acquire useful representations that improve FER performance in the image generation process, and propose a novel generative method based on the image inversion mechanism for the FER task, termed Inversion FER (IFER). Particularly, we devise a novel Adversarial Style Inversion Transformer (ASIT) towards IFER to comprehensively extract features of generated facial images. In addition, ASIT is equipped with an image inversion discriminator that measures the cosine similarity of semantic features between source and generated images, constrained by a distribution alignment loss. Finally, we introduce a feature modulation module to fuse the structural code and latent codes from ASIT for the subsequent FER work. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

talented-q/ifer-master
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Emotion and Mood Recognition · Face and Expression Recognition

MethodsMulti-Head Attention · Attention Is All You Need · Layer Normalization · Adam · Softmax · Dropout · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Label Smoothing · Absolute Position Encodings