VEIGAR: View-consistent Explicit Inpainting and Geometry Alignment for 3D object Removal
By: Pham Khai Nguyen Do , Bao Nguyen Tran , Nam Nguyen and more
Potential Business Impact:
Makes computer images look real from any angle.
Recent advances in Novel View Synthesis (NVS) and 3D generation have significantly improved editing tasks, with a primary emphasis on maintaining cross-view consistency throughout the generative process. Contemporary methods typically address this challenge using a dual-strategy framework: performing consistent 2D inpainting across all views guided by embedded priors either explicitly in pixel space or implicitly in latent space; and conducting 3D reconstruction with additional consistency guidance. Previous strategies, in particular, often require an initial 3D reconstruction phase to establish geometric structure, introducing considerable computational overhead. Even with the added cost, the resulting reconstruction quality often remains suboptimal. In this paper, we present VEIGAR, a computationally efficient framework that outperforms existing methods without relying on an initial reconstruction phase. VEIGAR leverages a lightweight foundation model to reliably align priors explicitly in the pixel space. In addition, we introduce a novel supervision strategy based on scale-invariant depth loss, which removes the need for traditional scale-and-shift operations in monocular depth regularization. Through extensive experimentation, VEIGAR establishes a new state-of-the-art benchmark in reconstruction quality and cross-view consistency, while achieving a threefold reduction in training time compared to the fastest existing method, highlighting its superior balance of efficiency and effectiveness.
Similar Papers
Selfi: Self Improving Reconstruction Engine via 3D Geometric Feature Alignment
CV and Pattern Recognition
Creates realistic 3D scenes from photos.
IMFine: 3D Inpainting via Geometry-guided Multi-view Refinement
CV and Pattern Recognition
Fixes 3D pictures from any angle.
Perspective-aware 3D Gaussian Inpainting with Multi-view Consistency
CV and Pattern Recognition
Fixes 3D pictures so they look real from all sides.