SDTagNet: Leveraging Text-Annotated Navigation Maps for Online HD Map Construction
By: Fabian Immel , Jan-Hendrik Pauls , Richard Fehler and more
Potential Business Impact:
Helps self-driving cars see farther using old maps.
Autonomous vehicles rely on detailed and accurate environmental information to operate safely. High definition (HD) maps offer a promising solution, but their high maintenance cost poses a significant barrier to scalable deployment. This challenge is addressed by online HD map construction methods, which generate local HD maps from live sensor data. However, these methods are inherently limited by the short perception range of onboard sensors. To overcome this limitation and improve general performance, recent approaches have explored the use of standard definition (SD) maps as prior, which are significantly easier to maintain. We propose SDTagNet, the first online HD map construction method that fully utilizes the information of widely available SD maps, like OpenStreetMap, to enhance far range detection accuracy. Our approach introduces two key innovations. First, in contrast to previous work, we incorporate not only polyline SD map data with manually selected classes, but additional semantic information in the form of textual annotations. In this way, we enrich SD vector map tokens with NLP-derived features, eliminating the dependency on predefined specifications or exhaustive class taxonomies. Second, we introduce a point-level SD map encoder together with orthogonal element identifiers to uniformly integrate all types of map elements. Experiments on Argoverse 2 and nuScenes show that this boosts map perception performance by up to +5.9 mAP (+45%) w.r.t. map construction without priors and up to +3.2 mAP (+20%) w.r.t. previous approaches that already use SD map priors. Code is available at https://github.com/immel-f/SDTagNet
Similar Papers
NavMapFusion: Diffusion-based Fusion of Navigation Maps for Online Vectorized HD Map Construction
CV and Pattern Recognition
Makes self-driving cars see roads better.
Inferring Driving Maps by Deep Learning-based Trail Map Extraction
CV and Pattern Recognition
Makes self-driving cars learn roads from any car.
SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving
Robotics
Helps self-driving cars see better with simple maps.