Score: 1

World Model Robustness via Surprise Recognition

Published: November 30, 2025 | arXiv ID: 2512.01119v1

By: Geigh Zollicoffer , Tanush Chopra , Mingkuan Yan and more

Potential Business Impact:

Keeps AI safe from bad or surprising information.

Business Areas:

Simulation Software

AI systems deployed in the real world must contend with distractions and out-of-distribution (OOD) noise that can destabilize their policies and lead to unsafe behavior. While robust training can reduce sensitivity to some forms of noise, it is infeasible to anticipate all possible OOD conditions. To mitigate this issue, we develop an algorithm that leverages a world model's inherent measure of surprise to reduce the impact of noise in world model--based reinforcement learning agents. We introduce both multi-representation and single-representation rejection sampling, enabling robustness to settings with multiple faulty sensors or a single faulty sensor. While the introduction of noise typically degrades agent performance, we show that our techniques preserve performance relative to baselines under varying types and levels of noise across multiple environments within self-driving simulation domains (CARLA and Safety Gymnasium). Furthermore, we demonstrate that our methods enhance the stability of two state-of-the-art world models with markedly different underlying architectures: Cosmos and DreamerV3. Together, these results highlight the robustness of our approach across world modeling domains. We release our code at https://github.com/Bluefin-Tuna/WISER .

World Models for Anomaly Detection during Model-Based Reinforcement Learning Inference

Robotics

Keeps robots safe by stopping them when they're unsure.

4 Mar 2025 1

89%

World Models for Autonomous Navigation of Terrestrial Robots from LIDAR Observations

Robotics

Helps robots learn to drive better using less data.

3 Dec 2025 1

89%

AD-R1: Closed-Loop Reinforcement Learning for End-to-End Autonomous Driving with Impartial World Models

CV and Pattern Recognition

Teaches self-driving cars to avoid crashes.

25 Nov 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Repos / Data Links

github.com

Page Count

19 pages

World Model Robustness via Surprise Recognition

Keeps AI safe from bad or surprising information.

Technical Abstract

World Models for Anomaly Detection during Model-Based Reinforcement Learning Inference

World Models for Autonomous Navigation of Terrestrial Robots from LIDAR Observations

AD-R1: Closed-Loop Reinforcement Learning for End-to-End Autonomous Driving with Impartial World Models