Scene Coordinate Reconstruction Priors
By: Wenjing Bian , Axel Barroso-Laguna , Tommaso Cavallari and more
Potential Business Impact:
Makes 3D pictures more real and accurate.
Scene coordinate regression (SCR) models have proven to be powerful implicit scene representations for 3D vision, enabling visual relocalization and structure-from-motion. SCR models are trained specifically for one scene. If training images imply insufficient multi-view constraints SCR models degenerate. We present a probabilistic reinterpretation of training SCR models, which allows us to infuse high-level reconstruction priors. We investigate multiple such priors, ranging from simple priors over the distribution of reconstructed depth values to learned priors over plausible scene coordinate configurations. For the latter, we train a 3D point cloud diffusion model on a large corpus of indoor scans. Our priors push predicted 3D scene points towards plausible geometry at each training step to increase their likelihood. On three indoor datasets our priors help learning better scene representations, resulting in more coherent scene point clouds, higher registration rates and better camera poses, with a positive effect on down-stream tasks such as novel view synthesis and camera relocalization.
Similar Papers
ACE-G: Improving Generalization of Scene Coordinate Regression Through Query Pre-Training
CV and Pattern Recognition
Helps cameras find their place in new places.
A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios
CV and Pattern Recognition
Helps robots know where they are better.
Statistical Confidence Rescoring for Robust 3D Scene Graph Generation from Multi-View Images
CV and Pattern Recognition
Helps computers understand 3D scenes from pictures.