Unposed 3DGS Reconstruction with Probabilistic Procrustes Mapping
By: Chong Cheng , Zijian Wang , Sicheng Yu and more
Potential Business Impact:
Creates detailed 3D worlds from many photos.
Plain English Summary
Imagine you want to create a realistic 3D model of a real-world place, like a city street, using lots of photos. This new method makes it much easier and faster to build these 3D models, even with hundreds of pictures. This is a big deal because it means we can create incredibly detailed digital twins of our world for things like self-driving cars or virtual reality without needing super powerful computers or lots of manual work.
3D Gaussian Splatting (3DGS) has emerged as a core technique for 3D representation. Its effectiveness largely depends on precise camera poses and accurate point cloud initialization, which are often derived from pretrained Multi-View Stereo (MVS) models. However, in unposed reconstruction task from hundreds of outdoor images, existing MVS models may struggle with memory limits and lose accuracy as the number of input images grows. To address this limitation, we propose a novel unposed 3DGS reconstruction framework that integrates pretrained MVS priors with the probabilistic Procrustes mapping strategy. The method partitions input images into subsets, maps submaps into a global space, and jointly optimizes geometry and poses with 3DGS. Technically, we formulate the mapping of tens of millions of point clouds as a probabilistic Procrustes problem and solve a closed-form alignment. By employing probabilistic coupling along with a soft dustbin mechanism to reject uncertain correspondences, our method globally aligns point clouds and poses within minutes across hundreds of images. Moreover, we propose a joint optimization framework for 3DGS and camera poses. It constructs Gaussians from confidence-aware anchor points and integrates 3DGS differentiable rendering with an analytical Jacobian to jointly refine scene and poses, enabling accurate reconstruction and pose estimation. Experiments on Waymo and KITTI datasets show that our method achieves accurate reconstruction from unposed image sequences, setting a new state of the art for unposed 3DGS reconstruction.
Similar Papers
A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds
CV and Pattern Recognition
Creates 3D worlds faster without needing extra steps.
3R-GS: Best Practice in Optimizing Camera Poses Along with 3DGS
CV and Pattern Recognition
Makes 3D pictures look real, even with bad camera data.
Breaking the Vicious Cycle: Coherent 3D Gaussian Splatting from Sparse and Motion-Blurred Views
CV and Pattern Recognition
Makes 3D pictures from few, blurry photos.