Gaussian Pixel Codec Avatars: A Hybrid Representation for Efficient Rendering
By: Divam Gupta , Anuj Pahuja , Nemanja Bartolovic and more
Potential Business Impact:
Makes realistic talking heads for phones.
We present Gaussian Pixel Codec Avatars (GPiCA), photorealistic head avatars that can be generated from multi-view images and efficiently rendered on mobile devices. GPiCA utilizes a unique hybrid representation that combines a triangle mesh and anisotropic 3D Gaussians. This combination maximizes memory and rendering efficiency while maintaining a photorealistic appearance. The triangle mesh is highly efficient in representing surface areas like facial skin, while the 3D Gaussians effectively handle non-surface areas such as hair and beard. To this end, we develop a unified differentiable rendering pipeline that treats the mesh as a semi-transparent layer within the volumetric rendering paradigm of 3D Gaussian Splatting. We train neural networks to decode a facial expression code into three components: a 3D face mesh, an RGBA texture, and a set of 3D Gaussians. These components are rendered simultaneously in a unified rendering engine. The networks are trained using multi-view image supervision. Our results demonstrate that GPiCA achieves the realism of purely Gaussian-based avatars while matching the rendering performance of mesh-based avatars.
Similar Papers
HGC-Avatar: Hierarchical Gaussian Compression for Streamable Dynamic 3D Avatars
CV and Pattern Recognition
Makes 3D avatars look real and move smoothly.
TeGA: Texture Space Gaussian Avatars for High-Resolution Dynamic Head Modeling
CV and Pattern Recognition
Creates super-real 3D faces that move naturally.
3D Gaussian Head Avatars with Expressive Dynamic Appearances by Compact Tensorial Representations
CV and Pattern Recognition
Creates realistic 3D faces that move and look real.