Score: 1

Realistic Environmental Injection Attacks on GUI Agents

Published: September 14, 2025 | arXiv ID: 2509.11250v1

By: Yitong Zhang , Ximo Li , Liyi Cai and more

Potential Business Impact:

Makes websites trick computer helpers into making mistakes.

Business Areas:

Simulation Software

GUI agents built on LVLMs are increasingly used to interact with websites. However, their exposure to open-world content makes them vulnerable to Environmental Injection Attacks (EIAs) that hijack agent behavior via webpage elements. Many recent studies assume the attacker to be a regular user who can only upload a single trigger image, which is more realistic than earlier assumptions of website-level administrative control. However, these works still fall short of realism: (1) the trigger's position and surrounding context remain largely fixed between training and testing, failing to capture the dynamic nature of real webpages and (2) the trigger often occupies an unrealistically large area, whereas real-world images are typically small. To better reflect real-world scenarios, we introduce a more realistic threat model where the attacker is a regular user and the trigger image is small and embedded within a dynamically changing environment. As a result, existing attacks prove largely ineffective under this threat model. To better expose the vulnerabilities of GUI agents, we propose Chameleon, an attack framework with two main novelties. The first is LLM-Driven Environment Simulation, which automatically generates diverse and high-fidelity webpage simulations. The second is Attention Black Hole, which transforms attention weights into explicit supervisory signals that guide the agent's focus toward the trigger region. We evaluate Chameleon on 6 realistic websites and 4 representative LVLM-powered GUI agents, where it significantly outperforms existing methods. Ablation studies confirm that both novelties are critical to performance. Our findings reveal underexplored vulnerabilities in modern GUI agents and establish a robust foundation for future research on defense in open-world GUI agent systems. The code is publicly available at https://github.com/zhangyitonggg/attack2gui.

GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Environments?

Cryptography and Security

Protects phones from fake pop-ups tricking apps.

23 Oct 2025 1

89%

The Obvious Invisible Threat: LLM-Powered GUI Agents' Vulnerability to Fine-Print Injections

Human-Computer Interaction

Protects your private info from sneaky computer programs.

15 Apr 2025 4

89%

Manipulating LLM Web Agents with Indirect Prompt Injection Attack via HTML Accessibility Tree

Cryptography and Security

Hackers can trick web robots into doing bad things.

20 Jul 2025 2

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Repos / Data Links

github.com

Page Count

21 pages

Realistic Environmental Injection Attacks on GUI Agents

Makes websites trick computer helpers into making mistakes.

Technical Abstract

GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Environments?

The Obvious Invisible Threat: LLM-Powered GUI Agents' Vulnerability to Fine-Print Injections

Manipulating LLM Web Agents with Indirect Prompt Injection Attack via HTML Accessibility Tree