ReactFace: Online Multiple Appropriate Facial Reaction Generation in Dyadic Interactions
Cheng Luo, Siyang Song, Weicheng Xie, Micol Spitale, Zongyuan Ge,, Linlin Shen, Hatice Gunes

TL;DR
ReactFace is a novel framework that predicts and generates multiple diverse, synchronized, and appropriate facial reactions in dyadic interactions, addressing the diversity and dependency modeling limitations of prior deterministic approaches.
Contribution
This work introduces ReactFace, a framework that models facial reactions as a distribution, enabling the generation of multiple appropriate reactions synchronized with speaker behavior.
Findings
Successfully generates diverse facial reactions
Achieves high synchronization with speaker behaviors
Produces realistic facial reaction sequences
Abstract
In dyadic interaction, predicting the listener's facial reactions is challenging as different reactions could be appropriate in response to the same speaker's behaviour. Previous approaches predominantly treated this task as an interpolation or fitting problem, emphasizing deterministic outcomes but ignoring the diversity and uncertainty of human facial reactions. Furthermore, these methods often failed to model short-range and long-range dependencies within the interaction context, leading to issues in the synchrony and appropriateness of the generated facial reactions. To address these limitations, this paper reformulates the task as an extrapolation or prediction problem, and proposes an novel framework (called ReactFace) to generate multiple different but appropriate facial reactions from a speaker behaviour rather than merely replicating the corresponding listener facial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Speech and Audio Processing · Emotion and Mood Recognition
