On the Information-Theoretic Fragility of Robust Watermarking under Diffusion Editing
By: Yunyi Ni , Ziyu Yang , Ze Niu and more
Potential Business Impact:
Breaks hidden codes in pictures using AI.
Robust invisible watermarking embeds hidden information in images such that the watermark can survive various manipulations. However, the emergence of powerful diffusion-based image generation and editing techniques poses a new threat to these watermarking schemes. In this paper, we investigate the intersection of diffusion-based image editing and robust image watermarking. We analyze how diffusion-driven image edits can significantly degrade or even fully remove embedded watermarks from state-of-the-art robust watermarking systems. Both theoretical formulations and empirical experiments are provided. We prove that as a image undergoes iterative diffusion transformations, the mutual information between the watermarked image and the embedded payload approaches zero, causing watermark decoding to fail. We further propose a guided diffusion attack algorithm that explicitly targets and erases watermark signals during generation. We evaluate our approach on recent deep learning-based watermarking schemes and demonstrate near-zero watermark recovery rates after attack, while maintaining high visual fidelity of the regenerated images. Finally, we discuss ethical implications of such watermark removal capablities and provide design guidelines for future watermarking strategies to be more resilient in the era of generative AI.
Similar Papers
Diffusion-Based Image Editing: An Unforeseen Adversary to Robust Invisible Watermarks
Cryptography and Security
Makes hidden messages in pictures disappear.
Diffusion-Based Image Editing for Breaking Robust Watermarks
CV and Pattern Recognition
Breaks hidden messages in pictures using AI.
Visual Watermarking in the Era of Diffusion Models: Advances and Challenges
CV and Pattern Recognition
Protects pictures from being copied without permission.