Creative Image Generation with Diffusion Model
By: Kunpeng Song, Ahmed Elgammal
Potential Business Impact:
Makes computers create brand new, surprising pictures.
Creative image generation has emerged as a compelling area of research, driven by the need to produce novel and high-quality images that expand the boundaries of imagination. In this work, we propose a novel framework for creative generation using diffusion models, where creativity is associated with the inverse probability of an image's existence in the CLIP embedding space. Unlike prior approaches that rely on a manual blending of concepts or exclusion of subcategories, our method calculates the probability distribution of generated images and drives it towards low-probability regions to produce rare, imaginative, and visually captivating outputs. We also introduce pullback mechanisms, achieving high creativity without sacrificing visual fidelity. Extensive experiments on text-to-image diffusion models demonstrate the effectiveness and efficiency of our creative generation framework, showcasing its ability to produce unique, novel, and thought-provoking images. This work provides a new perspective on creativity in generative models, offering a principled method to foster innovation in visual content synthesis.
Similar Papers
Generative Image Coding with Diffusion Prior
CV and Pattern Recognition
Makes pictures look good even when squeezed small.
Reversible Efficient Diffusion for Image Fusion
CV and Pattern Recognition
Makes fused pictures clearer by fixing noise.
Generative diffusion models for agricultural AI: plant image generation, indoor-to-outdoor translation, and expert preference alignment
CV and Pattern Recognition
Makes AI better at growing crops with fake pictures.