Real-World Depth Recovery via Structure Uncertainty Modeling and Inaccurate GT Depth Fitting
By: Delong Suzhang, Meng Yang
Potential Business Impact:
Fixes blurry depth pictures for better 3D understanding.
The low-quality structure in raw depth maps is prevalent in real-world RGB-D datasets, which makes real-world depth recovery a critical task in recent years. However, the lack of paired raw-ground truth (raw-GT) data in the real world poses challenges for generalized depth recovery. Existing methods insufficiently consider the diversity of structure misalignment in raw depth maps, which leads to poor generalization in real-world depth recovery. Notably, random structure misalignments are not limited to raw depth data but also affect GT depth in real-world datasets. In the proposed method, we tackle the generalization problem from both input and output perspectives. For input, we enrich the diversity of structure misalignment in raw depth maps by designing a new raw depth generation pipeline, which helps the network avoid overfitting to a specific condition. Furthermore, a structure uncertainty module is designed to explicitly identify the misaligned structure for input raw depth maps to better generalize in unseen scenarios. Notably the well-trained depth foundation model (DFM) can help the structure uncertainty module estimate the structure uncertainty better. For output, a robust feature alignment module is designed to precisely align with the accurate structure of RGB images avoiding the interference of inaccurate GT depth. Extensive experiments on multiple datasets demonstrate the proposed method achieves competitive accuracy and generalization capabilities across various challenging raw depth maps.
Similar Papers
Perfecting Depth: Uncertainty-Aware Enhancement of Metric Depth
CV and Pattern Recognition
Makes 3D pictures clearer and more accurate.
Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure Priors
CV and Pattern Recognition
Helps cars see distance in rain and dark.
Integrating Disparity Confidence Estimation into Relative Depth Prior-Guided Unsupervised Stereo Matching
CV and Pattern Recognition
Helps robots see depth without needing labeled training data.