Score: 0

Gaussian Pixel Codec Avatars: A Hybrid Representation for Efficient Rendering

Published: December 17, 2025 | arXiv ID: 2512.15711v1

By: Divam Gupta , Anuj Pahuja , Nemanja Bartolovic and more

Potential Business Impact:

Makes realistic talking heads for phones.

Business Areas:
Image Recognition Data and Analytics, Software

We present Gaussian Pixel Codec Avatars (GPiCA), photorealistic head avatars that can be generated from multi-view images and efficiently rendered on mobile devices. GPiCA utilizes a unique hybrid representation that combines a triangle mesh and anisotropic 3D Gaussians. This combination maximizes memory and rendering efficiency while maintaining a photorealistic appearance. The triangle mesh is highly efficient in representing surface areas like facial skin, while the 3D Gaussians effectively handle non-surface areas such as hair and beard. To this end, we develop a unified differentiable rendering pipeline that treats the mesh as a semi-transparent layer within the volumetric rendering paradigm of 3D Gaussian Splatting. We train neural networks to decode a facial expression code into three components: a 3D face mesh, an RGBA texture, and a set of 3D Gaussians. These components are rendered simultaneously in a unified rendering engine. The networks are trained using multi-view image supervision. Our results demonstrate that GPiCA achieves the realism of purely Gaussian-based avatars while matching the rendering performance of mesh-based avatars.

Page Count
10 pages

Category
Computer Science:
CV and Pattern Recognition