Score: 0

When Robots Obey the Patch: Universal Transferable Patch Attacks on Vision-Language-Action Models

Published: November 26, 2025 | arXiv ID: 2511.21192v1

By: Hui Lu , Yi Yu , Yiming Yang and more

Potential Business Impact:

Makes robots easily fooled by fake pictures.

Business Areas:

Image Recognition Data and Analytics, Software

Vision-Language-Action (VLA) models are vulnerable to adversarial attacks, yet universal and transferable attacks remain underexplored, as most existing patches overfit to a single model and fail in black-box settings. To address this gap, we present a systematic study of universal, transferable adversarial patches against VLA-driven robots under unknown architectures, finetuned variants, and sim-to-real shifts. We introduce UPA-RFAS (Universal Patch Attack via Robust Feature, Attention, and Semantics), a unified framework that learns a single physical patch in a shared feature space while promoting cross-model transfer. UPA-RFAS combines (i) a feature-space objective with an $\ell_1$ deviation prior and repulsive InfoNCE loss to induce transferable representation shifts, (ii) a robustness-augmented two-phase min-max procedure where an inner loop learns invisible sample-wise perturbations and an outer loop optimizes the universal patch against this hardened neighborhood, and (iii) two VLA-specific losses: Patch Attention Dominance to hijack text$\to$vision attention and Patch Semantic Misalignment to induce image-text mismatch without labels. Experiments across diverse VLA models, manipulation suites, and physical executions show that UPA-RFAS consistently transfers across models, tasks, and viewpoints, exposing a practical patch-based attack surface and establishing a strong baseline for future defenses.

Attention-Guided Patch-Wise Sparse Adversarial Attacks on Vision-Language-Action Models

CV and Pattern Recognition

Tricks robots into making wrong moves.

26 Nov 2025 0

92%

Model-agnostic Adversarial Attack and Defense for Vision-Language-Action Models

CV and Pattern Recognition

Makes robots follow bad instructions or ignore them.

15 Oct 2025 0

91%

When Alignment Fails: Multimodal Adversarial Attacks on Vision-Language-Action Models

CV and Pattern Recognition

Makes robots understand and obey commands better.

20 Nov 2025 0

View PDF Login to Bookmark

Page Count

11 pages

When Robots Obey the Patch: Universal Transferable Patch Attacks on Vision-Language-Action Models

Makes robots easily fooled by fake pictures.

Technical Abstract

Attention-Guided Patch-Wise Sparse Adversarial Attacks on Vision-Language-Action Models

Model-agnostic Adversarial Attack and Defense for Vision-Language-Action Models

When Alignment Fails: Multimodal Adversarial Attacks on Vision-Language-Action Models