Score: 1

Exposing the Copycat Problem of Imitation-based Planner: A Novel Closed-Loop Simulator, Causal Benchmark and Joint IL-RL Baseline

Published: April 20, 2025 | arXiv ID: 2504.14709v1

By: Hui Zhou, Shaoshuai Shi, Hongsheng Li

Potential Business Impact:

Teaches self-driving cars to learn better.

Business Areas:

Simulation Software

Machine learning (ML)-based planners have recently gained significant attention. They offer advantages over traditional optimization-based planning algorithms. These advantages include fewer manually selected parameters and faster development. Within ML-based planning, imitation learning (IL) is a common algorithm. It primarily learns driving policies directly from supervised trajectory data. While IL has demonstrated strong performance on many open-loop benchmarks, it remains challenging to determine if the learned policy truly understands fundamental driving principles, rather than simply extrapolating from the ego-vehicle's initial state. Several studies have identified this limitation and proposed algorithms to address it. However, these methods often use original datasets for evaluation. In these datasets, future trajectories are heavily dependent on initial conditions. Furthermore, IL often overfits to the most common scenarios. It struggles to generalize to rare or unseen situations. To address these challenges, this work proposes: 1) a novel closed-loop simulator supporting both imitation and reinforcement learning, 2) a causal benchmark derived from the Waymo Open Dataset to rigorously assess the impact of the copycat problem, and 3) a novel framework integrating imitation learning and reinforcement learning to overcome the limitations of purely imitative approaches. The code for this work will be released soon.

Sequence of Expert: Boosting Imitation Planners for Autonomous Driving through Temporal Alternation

Robotics

Makes self-driving cars safer by fixing errors.

15 Dec 2025 2

88%

CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving

CV and Pattern Recognition

Makes self-driving cars safer and better drivers.

14 Oct 2025 1

88%

A Simple Approach to Constraint-Aware Imitation Learning with Application to Autonomous Racing

Machine Learning (CS)

Teaches robots to drive safely and fast.

10 Mar 2025 0

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

8 pages

Exposing the Copycat Problem of Imitation-based Planner: A Novel Closed-Loop Simulator, Causal Benchmark and Joint IL-RL Baseline

Teaches self-driving cars to learn better.

Technical Abstract

Sequence of Expert: Boosting Imitation Planners for Autonomous Driving through Temporal Alternation

CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving

A Simple Approach to Constraint-Aware Imitation Learning with Application to Autonomous Racing