Score: 0

Immunizing Images from Text to Image Editing via Adversarial Cross-Attention

Published: September 12, 2025 | arXiv ID: 2509.10359v1

By: Matteo Trippodo, Federico Becattini, Lorenzo Seidenari

Potential Business Impact:

Makes AI image editing fooled by fake descriptions.

Business Areas:

Image Recognition Data and Analytics, Software

Recent advances in text-based image editing have enabled fine-grained manipulation of visual content guided by natural language. However, such methods are susceptible to adversarial attacks. In this work, we propose a novel attack that targets the visual component of editing methods. We introduce Attention Attack, which disrupts the cross-attention between a textual prompt and the visual representation of the image by using an automatically generated caption of the source image as a proxy for the edit prompt. This breaks the alignment between the contents of the image and their textual description, without requiring knowledge of the editing method or the editing prompt. Reflecting on the reliability of existing metrics for immunization success, we propose two novel evaluation strategies: Caption Similarity, which quantifies semantic consistency between original and adversarial edits, and semantic Intersection over Union (IoU), which measures spatial layout disruption via segmentation masks. Experiments conducted on the TEDBench++ benchmark demonstrate that our attack significantly degrades editing performance while remaining imperceptible.

BiasMap: Leveraging Cross-Attentions to Discover and Mitigate Hidden Social Biases in Text-to-Image Generation

CV and Pattern Recognition

Finds and fixes unfairness in AI art.

16 Sep 2025 0

88%

IE-Critic-R1: Advancing the Explanatory Measurement of Text-Driven Image Editing for Human Perception Alignment

CV and Pattern Recognition

Helps computers judge edited pictures like people do.

22 Nov 2025 1

88%

A Generative Adversarial Approach to Adversarial Attacks Guided by Contrastive Language-Image Pre-trained Model

CV and Pattern Recognition

Makes AI fooled by tiny, hidden changes.

3 Nov 2025 0

View PDF Login to Bookmark

Country of Origin

🇮🇹 Italy

Page Count

10 pages

Immunizing Images from Text to Image Editing via Adversarial Cross-Attention

Makes AI image editing fooled by fake descriptions.

Technical Abstract

BiasMap: Leveraging Cross-Attentions to Discover and Mitigate Hidden Social Biases in Text-to-Image Generation

IE-Critic-R1: Advancing the Explanatory Measurement of Text-Driven Image Editing for Human Perception Alignment

A Generative Adversarial Approach to Adversarial Attacks Guided by Contrastive Language-Image Pre-trained Model