Score: 2

DrivIng: A Large-Scale Multimodal Driving Dataset with Full Digital Twin Integration

Published: January 21, 2026 | arXiv ID: 2601.15260v1

By: Dominik Rößle , Xujun Xie , Adithya Mohan and more

Potential Business Impact:

Makes self-driving cars see better in all conditions.

Business Areas:

Image Recognition Data and Analytics, Software

Perception is a cornerstone of autonomous driving, enabling vehicles to understand their surroundings and make safe, reliable decisions. Developing robust perception algorithms requires large-scale, high-quality datasets that cover diverse driving conditions and support thorough evaluation. Existing datasets often lack a high-fidelity digital twin, limiting systematic testing, edge-case simulation, sensor modification, and sim-to-real evaluations. To address this gap, we present DrivIng, a large-scale multimodal dataset with a complete geo-referenced digital twin of a ~18 km route spanning urban, suburban, and highway segments. Our dataset provides continuous recordings from six RGB cameras, one LiDAR, and high-precision ADMA-based localization, captured across day, dusk, and night. All sequences are annotated at 10 Hz with 3D bounding boxes and track IDs across 12 classes, yielding ~1.2 million annotated instances. Alongside the benefits of a digital twin, DrivIng enables a 1-to-1 transfer of real traffic into simulation, preserving agent interactions while enabling realistic and flexible scenario testing. To support reproducible research and robust validation, we benchmark DrivIng with state-of-the-art perception models and publicly release the dataset, digital twin, HD map, and codebase.

PercepTwin: Modeling High-Fidelity Digital Twins for Sim2Real LiDAR-based Perception for Intelligent Transportation Systems

CV and Pattern Recognition

Creates fake road data to train self-driving cars.

3 Sep 2025 0

90%

Collaborative Perception Datasets for Autonomous Driving: A Review

CV and Pattern Recognition

Helps self-driving cars share what they see.

17 Apr 2025 1

90%

Vision-Based Natural Language Scene Understanding for Autonomous Driving: An Extended Dataset and a New Model for Traffic Scene Description Generation

CV and Pattern Recognition

Lets cars describe what they see in words.

20 Jan 2026 1

View PDF Login to Bookmark

Country of Origin

🇩🇪 Germany

Repos / Data Links

github.com

Page Count

8 pages

DrivIng: A Large-Scale Multimodal Driving Dataset with Full Digital Twin Integration

Makes self-driving cars see better in all conditions.

Technical Abstract

PercepTwin: Modeling High-Fidelity Digital Twins for Sim2Real LiDAR-based Perception for Intelligent Transportation Systems

Collaborative Perception Datasets for Autonomous Driving: A Review

Vision-Based Natural Language Scene Understanding for Autonomous Driving: An Extended Dataset and a New Model for Traffic Scene Description Generation