One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework
By: Feiran Li , Qianqian Xu , Shilong Bao and more
Potential Business Impact:
Stops AI art from making bad pictures better.
Concept erasing has recently emerged as an effective paradigm to prevent text-to-image diffusion models from generating visually undesirable or even harmful content. However, current removal methods heavily rely on manually crafted text prompts, making it challenging to achieve a high erasure (efficacy) while minimizing the impact on other benign concepts (usability). In this paper, we attribute the limitations to the inherent gap between the text and image modalities, which makes it hard to transfer the intricately entangled concept knowledge from text prompts to the image generation process. To address this, we propose a novel solution by directly integrating visual supervision into the erasure process, introducing the first text-image Collaborative Concept Erasing (Co-Erasing) framework. Specifically, Co-Erasing describes the concept jointly by text prompts and the corresponding undesirable images induced by the prompts, and then reduces the generating probability of the target concept through negative guidance. This approach effectively bypasses the knowledge gap between text and image, significantly enhancing erasure efficacy. Additionally, we design a text-guided image concept refinement strategy that directs the model to focus on visual features most relevant to the specified text concept, minimizing disruption to other benign concepts. Finally, comprehensive experiments suggest that Co-Erasing outperforms state-of-the-art erasure approaches significantly with a better trade-off between efficacy and usability. Codes are available at https://github.com/Ferry-Li/Co-Erasing.
Similar Papers
CE-SDWV: Effective and Efficient Concept Erasure for Text-to-Image Diffusion Models via a Semantic-Driven Word Vocabulary
CV and Pattern Recognition
Stops AI from making bad pictures.
Erasing More Than Intended? How Concept Erasure Degrades the Generation of Non-Target Concepts
CV and Pattern Recognition
Fixes AI art when it messes up other things.
Bi-Erasing: A Bidirectional Framework for Concept Removal in Diffusion Models
CV and Pattern Recognition
Removes bad pictures, keeps good ones.