Score: 3

FastAvatar: Instant 3D Gaussian Splatting for Faces from Single Unconstrained Poses

Published: August 25, 2025 | arXiv ID: 2508.18389v1

By: Hao Liang , Zhixuan Ge , Ashish Tiwari and more

BigTech Affiliations: Samsung

Potential Business Impact:

Creates realistic 3D faces from one picture.

Business Areas:
Facial Recognition Data and Analytics, Software

We present FastAvatar, a pose-invariant, feed-forward framework that can generate a 3D Gaussian Splatting (3DGS) model from a single face image from an arbitrary pose in near-instant time (<10ms). FastAvatar uses a novel encoder-decoder neural network design to achieve both fast fitting and identity preservation regardless of input pose. First, FastAvatar constructs a 3DGS face ``template'' model from a training dataset of faces with multi-view captures. Second, FastAvatar encodes the input face image into an identity-specific and pose-invariant latent embedding, and decodes this embedding to predict residuals to the structural and appearance parameters of each Gaussian in the template 3DGS model. By only inferring residuals in a feed-forward fashion, model inference is fast and robust. FastAvatar significantly outperforms existing feed-forward face 3DGS methods (e.g., GAGAvatar) in reconstruction quality, and runs 1000x faster than per-face optimization methods (e.g., FlashAvatar, GaussianAvatars and GASP). In addition, FastAvatar's novel latent space design supports real-time identity interpolation and attribute editing which is not possible with any existing feed-forward 3DGS face generation framework. FastAvatar's combination of excellent reconstruction quality and speed expands the scope of 3DGS for photorealistic avatar applications in consumer and interactive systems.

Country of Origin
🇰🇷 🇺🇸 🇮🇳 South Korea, United States, India

Repos / Data Links

Page Count
12 pages

Category
Computer Science:
CV and Pattern Recognition