SEED Research & Announcements Blogs Publications Open Source Careers Contact Us Research & Announcements Blogs Publications Open Source Careers Contact Us

SCA 2022: Voice2Face Audio-Driven Facial and Tongue Rig Animations with cVAEs

This is a research presentation from the Eurographics Symposium on Computer Animation (SCA 2022). Authors: Mónica Villanueva Aylagas, Héctor Anadon Leon, Mattias Teye, and Konrad Tollmar.

Download the full research paper. (5.9 MB PDF)

In this paper, we present Voice2Face, a tool that generates facial and tongue animations directly from recorded speech using machine learning.

Our approach consists of two steps: a conditional Variational Autoencoder generates mesh animations from speech, while a separate module maps the animations to rig controller space. Our contributions include an automated method for speech style control, a method to train a model with data from multiple quality levels, and a method for animating the tongue. 

Unlike previous works, our model generates animations without speaker-dependent characteristics, while allowing speech style control.

We demonstrate through a user study that Voice2Face significantly outperforms a comparative state-of-the-art model, and our quantitative evaluation suggests that Voice2Face yields more accurate lip closure in speech with bilabials through our speech style optimization. Both evaluations also show that our data quality conditioning scheme outperforms both an unconditioned model and a model trained with a smaller high-quality dataset. Finally, the user study shows a preference for animations including the tongue. 

Evaluating Data-Driven Co-Speech Gestures of Embodied Conversational Agents through Real-Time Interaction

Related News

A Theory of Stabilization by Skull Carving

SEED
Dec 3, 2024
A new approach to stabilizing facial motion for creating photo-real avatars that significantly enhances accuracy and robustness.

Gigi Lightning Talks

SEED
Sep 26, 2024
SEED brought together developers to show off their prowess using the Gigi rapid prototyping platform for real-time rendering.

SEED's Adventure in Gameplay Innovation

SEED
Sep 13, 2024
SEED is branching out into the world of game mechanics, storytelling magic, and interactive wonders.