A Novel FACS-Aligned Anatomical Text Description Paradigm for Fine-Grained Facial Behavior Synthesis
Jiahe Wang, Cong Liang, Xuandong Huang, Yuxin Wang, Xin Yun, Yi Wu, Yanan Chang, Shangfei Wang

TL;DR
This paper introduces a new paradigm for fine-grained facial behavior synthesis using anatomically grounded FACS-based AU descriptions, along with a large dataset, a novel metric, and a baseline framework.
Contribution
It proposes a novel task paradigm, develops a FACS rule-based text processor, creates a large paired dataset, introduces a new semantic consistency metric, and designs a robust baseline framework.
Findings
The paradigm outperforms state-of-the-art methods in challenging scenarios.
The dataset BP4D-AUText contains over 302K samples for training.
The new metric AAAD effectively measures semantic consistency between descriptions and muscle movements.
Abstract
Facial behavior constitutes the primary medium of human nonverbal communication. Existing synthesis methods predominantly follow two paradigms: coarse emotion category labels or one-hot Action Unit (AU) vectors from the Facial Action Coding System (FACS). Neither paradigm reliably renders fine-grained facial behaviors nor resolves anatomically implausible artifacts caused by conflicting AUs. Therefore, we propose a novel task paradigm: anatomically grounded facial behavior synthesis from FACS-based AU descriptions. This paradigm explicitly encodes FACS-defined muscle movement rules, inter-AU interactions, and conflict resolution mechanisms into natural language control signals. To enable systematic research, we develop a dynamic AU text processor, a FACS rule-based module that converts raw AU annotations into anatomically consistent natural language descriptions. Using this processor,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
