AHA! Animating Human Avatars in Diverse Scenes with Gaussian Splatting
By: Aymen Mir , Jian Wang , Riza Alp Guler and more
Potential Business Impact:
Makes animated people look real in 3D videos.
We present a novel framework for animating humans in 3D scenes using 3D Gaussian Splatting (3DGS), a neural scene representation that has recently achieved state-of-the-art photorealistic results for novel-view synthesis but remains under-explored for human-scene animation and interaction. Unlike existing animation pipelines that use meshes or point clouds as the underlying 3D representation, our approach introduces the use of 3DGS as the 3D representation to the problem of animating humans in scenes. By representing humans and scenes as Gaussians, our approach allows for geometry-consistent free-viewpoint rendering of humans interacting with 3D scenes. Our key insight is that the rendering can be decoupled from the motion synthesis and each sub-problem can be addressed independently, without the need for paired human-scene data. Central to our method is a Gaussian-aligned motion module that synthesizes motion without explicit scene geometry, using opacity-based cues and projected Gaussian structures to guide human placement and pose alignment. To ensure natural interactions, we further propose a human-scene Gaussian refinement optimization that enforces realistic contact and navigation. We evaluate our approach on scenes from Scannet++ and the SuperSplat library, and on avatars reconstructed from sparse and dense multi-view human capture. Finally, we demonstrate that our framework allows for novel applications such as geometry-consistent free-viewpoint rendering of edited monocular RGB videos with new animated humans, showcasing the unique advantage of 3DGS for monocular video-based human animation.
Similar Papers
2DGS-Avatar: Animatable High-fidelity Clothed Avatar via 2D Gaussian Splatting
CV and Pattern Recognition
Creates lifelike animated people from videos.
HyperGaussians: High-Dimensional Gaussian Splatting for High-Fidelity Animatable Face Avatars
CV and Pattern Recognition
Makes digital faces look more real and move naturally.
HGC-Avatar: Hierarchical Gaussian Compression for Streamable Dynamic 3D Avatars
CV and Pattern Recognition
Makes 3D avatars look real and move smoothly.