Training-Free Color-Aware Adversarial Diffusion Sanitization for Diffusion Stegomalware Defense at Security Gateways
By: Vladimir Frants, Sos Agaian
Potential Business Impact:
Stops secret messages hidden in AI images.
The rapid expansion of generative AI has normalized large-scale synthetic media creation, enabling new forms of covert communication. Recent generative steganography methods, particularly those based on diffusion models, can embed high-capacity payloads without fine-tuning or auxiliary decoders, creating significant challenges for detection and remediation. Coverless diffusion-based techniques are difficult to counter because they generate image carriers directly from secret data, enabling attackers to deliver stegomalware for command-and-control, payload staging, and data exfiltration while bypassing detectors that rely on cover-stego discrepancies. This work introduces Adversarial Diffusion Sanitization (ADS), a training-free defense for security gateways that neutralizes hidden payloads rather than detecting them. ADS employs an off-the-shelf pretrained denoiser as a differentiable proxy for diffusion-based decoders and incorporates a color-aware, quaternion-coupled update rule to reduce artifacts under strict distortion limits. Under a practical threat model and in evaluation against the state-of-the-art diffusion steganography method Pulsar, ADS drives decoder success rates to near zero with minimal perceptual impact. Results demonstrate that ADS provides a favorable security-utility trade-off compared to standard content transformations, offering an effective mitigation strategy against diffusion-driven steganography.
Similar Papers
Anti-Diffusion: Preventing Abuse of Modifications of Diffusion-Based Models
CV and Pattern Recognition
Stops AI from making fake pictures of people.
PSyDUCK: Training-Free Steganography for Latent Diffusion
Machine Learning (CS)
Hides secret messages in pictures and videos.
Adversarial-Guided Diffusion for Multimodal LLM Attacks
CV and Pattern Recognition
Tricks AI into seeing fake things in pictures.