VividFace: Real-Time and Realistic Facial Expression Shadowing for Humanoid Robots
Peizhen Li, Longbing Cao, Xiao-Ming Wu, and Yang Zhang

TL;DR
VividFace is a system that enables humanoid robots to imitate human facial expressions in real time with high realism, using optimized transfer and inference techniques for lifelike interaction.
Contribution
The paper introduces VividFace, a novel system that achieves real-time, realistic facial expression shadowing for humanoid robots through enhanced transfer models and efficient inference pipelines.
Findings
Produces humanoid facial expressions within 0.05 seconds
Generalizes across diverse facial configurations
Validated through extensive real-world demonstrations
Abstract
Humanoid facial expression shadowing enables robots to realistically imitate human facial expressions in real time, which is critical for lifelike, facially expressive humanoid robots and affective human-robot interaction. Existing progress in humanoid facial expression imitation remains limited, often failing to achieve either real-time performance or realistic expressiveness due to offline video-based inference designs and insufficient ability to capture and transfer subtle expression details. To address these limitations, we present VividFace, a real-time and realistic facial expression shadowing system for humanoid robots. An optimized imitation framework X2CNet++ enhances expressiveness by fine-tuning the human-to-humanoid facial motion transfer module and introducing a feature-adaptation training strategy for better alignment across different image sources. Real-time shadowing is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI · Face recognition and analysis · Emotion and Mood Recognition
