Score: 0

Systematic Benchmarking of SUMO Against Data-Driven Traffic Simulators

Published: December 20, 2025 | arXiv ID: 2512.18537v1

By: Erdao Liang

This paper presents a systematic benchmarking of the model-based microscopic traffic simulator SUMO against state-of-the-art data-driven traffic simulators using large-scale real-world datasets. Using the Waymo Open Motion Dataset (WOMD) and the Waymo Open Sim Agents Challenge (WOSAC), we evaluate SUMO under both short-horizon (8s) and long-horizon (60s) closed-loop simulation settings. To enable scalable evaluation, we develop Waymo2SUMO, an automated pipeline that converts WOMD scenarios into SUMO simulations. On the WOSAC benchmark, SUMO achieves a realism meta metric of 0.653 while requiring fewer than 100 tunable parameters. Extended rollouts show that SUMO maintains low collision and offroad rates and exhibits stronger long-horizon stability than representative data-driven simulators. These results highlight complementary strengths of model-based and data-driven approaches for autonomous driving simulation and benchmarking.

Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving

Robotics

Tests if AI traffic simulators are good.

3 Aug 2025 3

89%

Can the Waymo Open Motion Dataset Support Realistic Behavioral Modeling? A Validation Study with Naturalistic Trajectories

Robotics

Tests show self-driving car data is not realistic.

3 Sep 2025 0

88%

AgentSUMO: An Agentic Framework for Interactive Simulation Scenario Generation in SUMO via Large Language Models

Human-Computer Interaction

Lets city planners test traffic ideas easily.

10 Nov 2025 0

View PDF Login to Bookmark

Systematic Benchmarking of SUMO Against Data-Driven Traffic Simulators

Technical Abstract

Beyond Simulation: Benchmarking World Models for Planning and Causality in Autonomous Driving

Can the Waymo Open Motion Dataset Support Realistic Behavioral Modeling? A Validation Study with Naturalistic Trajectories

AgentSUMO: An Agentic Framework for Interactive Simulation Scenario Generation in SUMO via Large Language Models