MDG: Masked Denoising Generation for Multi-Agent Behavior Modeling in Traffic Environments
By: Zhiyu Huang , Zewei Zhou , Tianhui Cai and more
Potential Business Impact:
Makes self-driving cars predict and plan better.
Modeling realistic and interactive multi-agent behavior is critical to autonomous driving and traffic simulation. However, existing diffusion and autoregressive approaches are limited by iterative sampling, sequential decoding, or task-specific designs, which hinder efficiency and reuse. We propose Masked Denoising Generation (MDG), a unified generative framework that reformulates multi-agent behavior modeling as the reconstruction of independently noised spatiotemporal tensors. Instead of relying on diffusion time steps or discrete tokenization, MDG applies continuous, per-agent and per-timestep noise masks that enable localized denoising and controllable trajectory generation in a single or few forward passes. This mask-driven formulation generalizes across open-loop prediction, closed-loop simulation, motion planning, and conditional generation within one model. Trained on large-scale real-world driving datasets, MDG achieves competitive closed-loop performance on the Waymo Sim Agents and nuPlan Planning benchmarks, while providing efficient, consistent, and controllable open-loop multi-agent trajectory generation. These results position MDG as a simple yet versatile paradigm for multi-agent behavior modeling.
Similar Papers
Masked Diffusion Models are Secretly Learned-Order Autoregressive Models
Machine Learning (CS)
Teaches computers to create ordered text better.
Masked Generative Policy for Robotic Control
Robotics
Robots learn to do tasks faster and better.
Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies
Machine Learning (CS)
Teaches computers to write better sentences.