Score: 1

State Backdoor: Towards Stealthy Real-world Poisoning Attack on Vision-Language-Action Model in State Space

Published: January 7, 2026 | arXiv ID: 2601.04266v1

By: Ji Guo , Wenbo Jiang , Yansong Lin and more

Potential Business Impact:

Hides bad commands in robot starting positions.

Business Areas:

Autonomous Vehicles Transportation

Vision-Language-Action (VLA) models are widely deployed in safety-critical embodied AI applications such as robotics. However, their complex multimodal interactions also expose new security vulnerabilities. In this paper, we investigate a backdoor threat in VLA models, where malicious inputs cause targeted misbehavior while preserving performance on clean data. Existing backdoor methods predominantly rely on inserting visible triggers into visual modality, which suffer from poor robustness and low insusceptibility in real-world settings due to environmental variability. To overcome these limitations, we introduce the State Backdoor, a novel and practical backdoor attack that leverages the robot arm's initial state as the trigger. To optimize trigger for insusceptibility and effectiveness, we design a Preference-guided Genetic Algorithm (PGA) that efficiently searches the state space for minimal yet potent triggers. Extensive experiments on five representative VLA models and five real-world tasks show that our method achieves over 90% attack success rate without affecting benign task performance, revealing an underexplored vulnerability in embodied AI systems.

TabVLA: Targeted Backdoor Attacks on Vision-Language-Action Models

Cryptography and Security

Makes robots do bad things when tricked.

13 Oct 2025 1

94%

AttackVLA: Benchmarking Adversarial and Backdoor Attacks on Vision-Language-Action Models

Cryptography and Security

Makes robots do bad things when told.

15 Nov 2025 0

93%

BadVLA: Towards Backdoor Attacks on Vision-Language-Action Models via Objective-Decoupled Optimization

Cryptography and Security

Makes robots do bad things when they see a secret sign.

22 May 2025 1

View PDF Login to Bookmark

Country of Origin

🇸🇬 🇨🇳 China, Singapore

Page Count

14 pages

State Backdoor: Towards Stealthy Real-world Poisoning Attack on Vision-Language-Action Model in State Space

Hides bad commands in robot starting positions.

Technical Abstract

TabVLA: Targeted Backdoor Attacks on Vision-Language-Action Models

AttackVLA: Benchmarking Adversarial and Backdoor Attacks on Vision-Language-Action Models

BadVLA: Towards Backdoor Attacks on Vision-Language-Action Models via Objective-Decoupled Optimization