Improving Optical Flow and Stereo Depth Estimation by Leveraging Uncertainty-Based Learning Difficulties
By: Jisoo Jeong , Hong Cai , Jamie Menjay Lin and more
Potential Business Impact:
Helps computers see depth and motion better.
Conventional training for optical flow and stereo depth models typically employs a uniform loss function across all pixels. However, this one-size-fits-all approach often overlooks the significant variations in learning difficulty among individual pixels and contextual regions. This paper investigates the uncertainty-based confidence maps which capture these spatially varying learning difficulties and introduces tailored solutions to address them. We first present the Difficulty Balancing (DB) loss, which utilizes an error-based confidence measure to encourage the network to focus more on challenging pixels and regions. Moreover, we identify that some difficult pixels and regions are affected by occlusions, resulting from the inherently ill-posed matching problem in the absence of real correspondences. To address this, we propose the Occlusion Avoiding (OA) loss, designed to guide the network into cycle consistency-based confident regions, where feature matching is more reliable. By combining the DB and OA losses, we effectively manage various types of challenging pixels and regions during training. Experiments on both optical flow and stereo depth tasks consistently demonstrate significant performance improvements when applying our proposed combination of the DB and OA losses.
Similar Papers
Integrating Disparity Confidence Estimation into Relative Depth Prior-Guided Unsupervised Stereo Matching
CV and Pattern Recognition
Helps robots see depth without needing labeled training data.
Revisiting Gradient-based Uncertainty for Monocular Depth Estimation
CV and Pattern Recognition
Helps computers know when they're wrong about depth.
Uncertainty-Aware Data-Based Method for Fast and Reliable Shape Optimization
Machine Learning (CS)
Makes airplane wings better by predicting mistakes.