Score: 3

RealDriveSim: A Realistic Multi-Modal Multi-Task Synthetic Dataset for Autonomous Driving

Published: June 19, 2025 | arXiv ID: 2506.16319v1

By: Arpit Jadon , Haoran Wang , Phillip Thomas and more

BigTech Affiliations: Huawei

Potential Business Impact:

Creates realistic fake driving scenes for self-driving cars.

Business Areas:
Simulation Software

As perception models continue to develop, the need for large-scale datasets increases. However, data annotation remains far too expensive to effectively scale and meet the demand. Synthetic datasets provide a solution to boost model performance with substantially reduced costs. However, current synthetic datasets remain limited in their scope, realism, and are designed for specific tasks and applications. In this work, we present RealDriveSim, a realistic multi-modal synthetic dataset for autonomous driving that not only supports popular 2D computer vision applications but also their LiDAR counterparts, providing fine-grained annotations for up to 64 classes. We extensively evaluate our dataset for a wide range of applications and domains, demonstrating state-of-the-art results compared to existing synthetic benchmarks. The dataset is publicly available at https://realdrivesim.github.io/.

Country of Origin
🇨🇳 🇨🇭 Switzerland, China

Page Count
8 pages

Category
Computer Science:
CV and Pattern Recognition