Loading paper
Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation | Tomesphere