Mapping like a Skeptic: Probabilistic BEV Projection for Online HD Mapping
By: Fatih Erdoğan, Merve Rabia Barın, Fatma Güney
Potential Business Impact:
Makes self-driving cars see roads better.
Constructing high-definition (HD) maps from sensory input requires accurately mapping the road elements in image space to the Bird's Eye View (BEV) space. The precision of this mapping directly impacts the quality of the final vectorized HD map. Existing HD mapping approaches outsource the projection to standard mapping techniques, such as attention-based ones. However, these methods struggle with accuracy due to generalization problems, often hallucinating non-existent road elements. Our key idea is to start with a geometric mapping based on camera parameters and adapt it to the scene to extract relevant map information from camera images. To implement this, we propose a novel probabilistic projection mechanism with confidence scores to (i) refine the mapping to better align with the scene and (ii) filter out irrelevant elements that should not influence HD map generation. In addition, we improve temporal processing by using confidence scores to selectively accumulate reliable information over time. Experiments on new splits of the nuScenes and Argoverse2 datasets demonstrate improved performance over state-of-the-art approaches, indicating better generalization. The improvements are particularly pronounced on nuScenes and in the challenging long perception range. Our code and model checkpoints are available at https://github.com/Fatih-Erdogan/mapping-like-skeptic .
Similar Papers
BEVTraj: Map-Free End-to-End Trajectory Prediction in Bird's-Eye View with Deformable Attention and Sparse Goal Proposals
CV and Pattern Recognition
Helps self-driving cars see without maps.
An Initial Study of Bird's-Eye View Generation for Autonomous Vehicles using Cross-View Transformers
CV and Pattern Recognition
Helps self-driving cars see roads from above.
Bridging Perspectives: Foundation Model Guided BEV Maps for 3D Object Detection and Tracking
CV and Pattern Recognition
Helps self-driving cars see better in 3D.