Score: 0

MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning

Published: December 15, 2025 | arXiv ID: 2512.13636v1

By: Haoyu Fu , Diankun Zhang , Zongchuang Zhao and more

Potential Business Impact:

Teaches self-driving cars to learn by trying.

Business Areas:

Autonomous Vehicles Transportation

Current Vision-Language-Action (VLA) paradigms in autonomous driving primarily rely on Imitation Learning (IL), which introduces inherent challenges such as distribution shift and causal confusion. Online Reinforcement Learning offers a promising pathway to address these issues through trial-and-error learning. However, applying online reinforcement learning to VLA models in autonomous driving is hindered by inefficient exploration in continuous action spaces. To overcome this limitation, we propose MindDrive, a VLA framework comprising a large language model (LLM) with two distinct sets of LoRA parameters. The one LLM serves as a Decision Expert for scenario reasoning and driving decision-making, while the other acts as an Action Expert that dynamically maps linguistic decisions into feasible trajectories. By feeding trajectory-level rewards back into the reasoning space, MindDrive enables trial-and-error learning over a finite set of discrete linguistic driving decisions, instead of operating directly in a continuous action space. This approach effectively balances optimal decision-making in complex scenarios, human-like driving behavior, and efficient exploration in online reinforcement learning. MindDrive achieves strong closed-loop performance on the challenging Bench2Drive benchmark, with a Driving Score (DS) of 78.04 and a Success Rate (SR) of 55.09%. To the best of our knowledge, this is the first work to demonstrate the effectiveness of online reinforcement learning for the VLA model in autonomous driving.

DriveMind: A Dual-VLM based Reinforcement Learning Framework for Autonomous Driving

Robotics

Makes self-driving cars safer and smarter.

1 Jun 2025 0

93%

Reasoning-VLA: A Fast and General Vision-Language-Action Reasoning Model for Autonomous Driving

CV and Pattern Recognition

Helps self-driving cars drive smarter and faster.

25 Nov 2025 1

92%

AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning

CV and Pattern Recognition

Helps self-driving cars plan safer, faster trips.

16 Jun 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

16 pages

MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning

Teaches self-driving cars to learn by trying.

Technical Abstract

DriveMind: A Dual-VLM based Reinforcement Learning Framework for Autonomous Driving

Reasoning-VLA: A Fast and General Vision-Language-Action Reasoning Model for Autonomous Driving

AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning