RoGSplat: Learning Robust Generalizable Human Gaussian Splatting from Sparse Multi-View Images
By: Junjin Xiao , Qing Zhang , Yonewei Nie and more
Potential Business Impact:
Makes 3D models of people from few pictures.
This paper presents RoGSplat, a novel approach for synthesizing high-fidelity novel views of unseen human from sparse multi-view images, while requiring no cumbersome per-subject optimization. Unlike previous methods that typically struggle with sparse views with few overlappings and are less effective in reconstructing complex human geometry, the proposed method enables robust reconstruction in such challenging conditions. Our key idea is to lift SMPL vertices to dense and reliable 3D prior points representing accurate human body geometry, and then regress human Gaussian parameters based on the points. To account for possible misalignment between SMPL model and images, we propose to predict image-aligned 3D prior points by leveraging both pixel-level features and voxel-level features, from which we regress the coarse Gaussians. To enhance the ability to capture high-frequency details, we further render depth maps from the coarse 3D Gaussians to help regress fine-grained pixel-wise Gaussians. Experiments on several benchmark datasets demonstrate that our method outperforms state-of-the-art methods in novel view synthesis and cross-dataset generalization. Our code is available at https://github.com/iSEE-Laboratory/RoGSplat.
Similar Papers
SparSplat: Fast Multi-View Reconstruction with Generalizable 2D Gaussian Splatting
CV and Pattern Recognition
Makes 3D pictures from few photos, super fast.
Physics-Aware Human-Object Rendering from Sparse Views via 3D Gaussian Splatting
Graphics
Makes computer videos show people touching things realistically.
MeshSplat: Generalizable Sparse-View Surface Reconstruction via Gaussian Splatting
Graphics
Creates 3D shapes from few pictures.