CRCE: Coreference-Retention Concept Erasure in Text-to-Image Diffusion Models
By: Yuyang Xue , Edward Moroshko , Feng Chen and more
Potential Business Impact:
Removes unwanted images without deleting similar ones.
Text-to-Image diffusion models can produce undesirable content that necessitates concept erasure. However, existing methods struggle with under-erasure, leaving residual traces of targeted concepts, or over-erasure, mistakenly eliminating unrelated but visually similar concepts. To address these limitations, we introduce CRCE, a novel concept erasure framework that leverages Large Language Models to identify both semantically related concepts that should be erased alongside the target and distinct concepts that should be preserved. By explicitly modelling coreferential and retained concepts semantically, CRCE enables more precise concept removal, without unintended erasure. Experiments demonstrate that CRCE outperforms existing methods on diverse erasure tasks, including real-world object, person identities, and abstract intellectual property characteristics. The constructed dataset CorefConcept and the source code will be release upon acceptance.
Similar Papers
TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models
CV and Pattern Recognition
Stops AI from making bad pictures.
GrOCE:Graph-Guided Online Concept Erasure for Text-to-Image Diffusion Models
CV and Pattern Recognition
Removes bad ideas from AI art without ruining good ones.
ACE: Attentional Concept Erasure in Diffusion Models
CV and Pattern Recognition
Removes bad pictures from AI art generators.