T2VUnlearning: A Concept Erasing Method for Text-to-Video Diffusion Models
By: Xiaoyu Ye , Songjie Cheng , Yongtao Wang and more
Potential Business Impact:
Stops AI from making bad videos.
Recent advances in text-to-video (T2V) diffusion models have significantly enhanced the quality of generated videos. However, their ability to produce explicit or harmful content raises concerns about misuse and potential rights violations. Inspired by the success of unlearning techniques in erasing undesirable concepts from text-to-image (T2I) models, we extend unlearning to T2V models and propose a robust and precise unlearning method. Specifically, we adopt negatively-guided velocity prediction fine-tuning and enhance it with prompt augmentation to ensure robustness against LLM-refined prompts. To achieve precise unlearning, we incorporate a localization and a preservation regularization to preserve the model's ability to generate non-target concepts. Extensive experiments demonstrate that our method effectively erases a specific concept while preserving the model's generation capability for all other concepts, outperforming existing methods. We provide the unlearned models in \href{https://github.com/VDIGPKU/T2VUnlearning.git}{https://github.com/VDIGPKU/T2VUnlearning.git}.
Similar Papers
VideoEraser: Concept Erasure in Text-to-Video Diffusion Models
CV and Pattern Recognition
Stops AI from making bad videos.
VideoEraser: Concept Erasure in Text-to-Video Diffusion Models
CV and Pattern Recognition
Stops AI from making bad videos.
Erasing Concepts, Steering Generations: A Comprehensive Survey of Concept Suppression
CV and Pattern Recognition
Stops AI from making bad or copied pictures.