Watermarking Autoregressive Image Generation
By: Nikola Jovanović , Ismail Labiad , Tomáš Souček and more
Potential Business Impact:
Marks AI-made pictures so we know they're fake.
Watermarking the outputs of generative models has emerged as a promising approach for tracking their provenance. Despite significant interest in autoregressive image generation models and their potential for misuse, no prior work has attempted to watermark their outputs at the token level. In this work, we present the first such approach by adapting language model watermarking techniques to this setting. We identify a key challenge: the lack of reverse cycle-consistency (RCC), wherein re-tokenizing generated image tokens significantly alters the token sequence, effectively erasing the watermark. To address this and to make our method robust to common image transformations, neural compression, and removal attacks, we introduce (i) a custom tokenizer-detokenizer finetuning procedure that improves RCC, and (ii) a complementary watermark synchronization layer. As our experiments demonstrate, our approach enables reliable and robust watermark detection with theoretically grounded p-values.
Similar Papers
A Watermark for Auto-Regressive Image Generation Models
CV and Pattern Recognition
Marks fake pictures so you know they're not real.
Towards Robust Red-Green Watermarking for Autoregressive Image Generators
CV and Pattern Recognition
Marks AI-made pictures so you know they're fake.
Autoregressive Images Watermarking through Lexical Biasing: An Approach Resistant to Regeneration Attack
Cryptography and Security
Marks AI-made pictures so they can't be faked.