Blending 3D Geometry and Machine Learning for Multi-View Stereopsis
By: Vibhas Vats , Md. Alimoor Reza , David Crandall and more
Potential Business Impact:
Makes 3D pictures from photos faster.
Traditional multi-view stereo (MVS) methods primarily depend on photometric and geometric consistency constraints. In contrast, modern learning-based algorithms often rely on the plane sweep algorithm to infer 3D geometry, applying explicit geometric consistency (GC) checks only as a post-processing step, with no impact on the learning process itself. In this work, we introduce GC MVSNet plus plus, a novel approach that actively enforces geometric consistency of reference view depth maps across multiple source views (multi view) and at various scales (multi scale) during the learning phase (see Fig. 1). This integrated GC check significantly accelerates the learning process by directly penalizing geometrically inconsistent pixels, effectively halving the number of training iterations compared to other MVS methods. Furthermore, we introduce a densely connected cost regularization network with two distinct block designs simple and feature dense optimized to harness dense feature connections for enhanced regularization. Extensive experiments demonstrate that our approach achieves a new state of the art on the DTU and BlendedMVS datasets and secures second place on the Tanks and Temples benchmark. To our knowledge, GC MVSNet plus plus is the first method to enforce multi-view, multi-scale supervised geometric consistency during learning. Our code is available.
Similar Papers
CL-MVSNet: Unsupervised Multi-view Stereo with Dual-level Contrastive Learning
CV and Pattern Recognition
Makes 3D pictures from photos better.
Geometry-Consistent 4D Gaussian Splatting for Sparse-Input Dynamic View Synthesis
CV and Pattern Recognition
Creates realistic 3D scenes from few pictures.
MVGSR: Multi-View Consistency Gaussian Splatting for Robust Surface Reconstruction
CV and Pattern Recognition
Makes 3D models from moving pictures accurately.