Latent-OFER: Detect, Mask, and Reconstruct with Latent Vectors for   Occluded Facial Expression Recognition

Isack Lee; Eungi Lee; Seok Bong Yoo

arXiv:2307.11404·cs.CV·July 24, 2023

Latent-OFER: Detect, Mask, and Reconstruct with Latent Vectors for Occluded Facial Expression Recognition

Isack Lee, Eungi Lee, Seok Bong Yoo

PDF

Open Access 1 Repo

TL;DR

Latent-OFER introduces a novel approach combining occlusion detection, face restoration, and expression recognition using latent vectors and transformer-based models to improve occluded facial expression recognition in real-world scenarios.

Contribution

The paper proposes a new latent vector-based framework that detects occlusions, reconstructs occluded facial regions, and recognizes expressions without requiring fully annotated occlusion data.

Findings

01

Outperforms state-of-the-art occluded FER methods on multiple datasets.

02

Effectively detects and restores occluded facial regions.

03

Maintains high recognition accuracy even with unseen occlusions.

Abstract

Most research on facial expression recognition (FER) is conducted in highly controlled environments, but its performance is often unacceptable when applied to real-world situations. This is because when unexpected objects occlude the face, the FER network faces difficulties extracting facial features and accurately predicting facial expressions. Therefore, occluded FER (OFER) is a challenging problem. Previous studies on occlusion-aware FER have typically required fully annotated facial images for training. However, collecting facial images with various occlusions and expression annotations is time-consuming and expensive. Latent-OFER, the proposed method, can detect occlusions, restore occluded parts of the face as if they were unoccluded, and recognize them, improving FER accuracy. This approach involves three steps: First, the vision transformer (ViT)-based occlusion patch detector…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

leeisack/latent-ofer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition · Emotion and Mood Recognition · Advanced Computing and Algorithms

MethodsAttention Is All You Need · Linear Layer · Softmax · Residual Connection · Layer Normalization · Multi-Head Attention · Dense Connections · Vision Transformer