Latent-OFER: Detect, Mask, and Reconstruct with Latent Vectors for Occluded Facial Expression Recognition
Isack Lee, Eungi Lee, Seok Bong Yoo

TL;DR
Latent-OFER introduces a novel approach combining occlusion detection, face restoration, and expression recognition using latent vectors and transformer-based models to improve occluded facial expression recognition in real-world scenarios.
Contribution
The paper proposes a new latent vector-based framework that detects occlusions, reconstructs occluded facial regions, and recognizes expressions without requiring fully annotated occlusion data.
Findings
Outperforms state-of-the-art occluded FER methods on multiple datasets.
Effectively detects and restores occluded facial regions.
Maintains high recognition accuracy even with unseen occlusions.
Abstract
Most research on facial expression recognition (FER) is conducted in highly controlled environments, but its performance is often unacceptable when applied to real-world situations. This is because when unexpected objects occlude the face, the FER network faces difficulties extracting facial features and accurately predicting facial expressions. Therefore, occluded FER (OFER) is a challenging problem. Previous studies on occlusion-aware FER have typically required fully annotated facial images for training. However, collecting facial images with various occlusions and expression annotations is time-consuming and expensive. Latent-OFER, the proposed method, can detect occlusions, restore occluded parts of the face as if they were unoccluded, and recognize them, improving FER accuracy. This approach involves three steps: First, the vision transformer (ViT)-based occlusion patch detector…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Emotion and Mood Recognition · Advanced Computing and Algorithms
MethodsAttention Is All You Need · Linear Layer · Softmax · Residual Connection · Layer Normalization · Multi-Head Attention · Dense Connections · Vision Transformer
