FastAvatar: Instant 3D Gaussian Splatting for Faces from Single Unconstrained Poses
By: Hao Liang , Zhixuan Ge , Ashish Tiwari and more
Potential Business Impact:
Creates realistic 3D faces from one picture.
We present FastAvatar, a pose-invariant, feed-forward framework that can generate a 3D Gaussian Splatting (3DGS) model from a single face image from an arbitrary pose in near-instant time (<10ms). FastAvatar uses a novel encoder-decoder neural network design to achieve both fast fitting and identity preservation regardless of input pose. First, FastAvatar constructs a 3DGS face ``template'' model from a training dataset of faces with multi-view captures. Second, FastAvatar encodes the input face image into an identity-specific and pose-invariant latent embedding, and decodes this embedding to predict residuals to the structural and appearance parameters of each Gaussian in the template 3DGS model. By only inferring residuals in a feed-forward fashion, model inference is fast and robust. FastAvatar significantly outperforms existing feed-forward face 3DGS methods (e.g., GAGAvatar) in reconstruction quality, and runs 1000x faster than per-face optimization methods (e.g., FlashAvatar, GaussianAvatars and GASP). In addition, FastAvatar's novel latent space design supports real-time identity interpolation and attribute editing which is not possible with any existing feed-forward 3DGS face generation framework. FastAvatar's combination of excellent reconstruction quality and speed expands the scope of 3DGS for photorealistic avatar applications in consumer and interactive systems.
Similar Papers
FastAvatar: Towards Unified Fast High-Fidelity 3D Avatar Reconstruction with Large Gaussian Reconstruction Transformers
CV and Pattern Recognition
Makes 3D avatars from photos in seconds.
Instant Expressive Gaussian Head Avatar via 3D-Aware Expression Distillation
CV and Pattern Recognition
Makes talking faces in 3D, fast and real.
HyperGaussians: High-Dimensional Gaussian Splatting for High-Fidelity Animatable Face Avatars
CV and Pattern Recognition
Makes digital faces look more real and move naturally.