LAMS-Edit: Latent and Attention Mixing with Schedulers for Improved Content Preservation in Diffusion-Based Image and Style Editing
By: Wingwa Fu, Takayuki Okatani
Potential Business Impact:
Changes pictures accurately by blending ideas.
Text-to-Image editing using diffusion models faces challenges in balancing content preservation with edit application and handling real-image editing. To address these, we propose LAMS-Edit, leveraging intermediate states from the inversion process--an essential step in real-image editing--during edited image generation. Specifically, latent representations and attention maps from both processes are combined at each step using weighted interpolation, controlled by a scheduler. This technique, Latent and Attention Mixing with Schedulers (LAMS), integrates with Prompt-to-Prompt (P2P) to form LAMS-Edit--an extensible framework that supports precise editing with region masks and enables style transfer via LoRA. Extensive experiments demonstrate that LAMS-Edit effectively balances content preservation and edit application.
Similar Papers
LatentEdit: Adaptive Latent Control for Consistent Semantic Editing
Graphics
Changes pictures while keeping the background the same.
PixPerfect: Seamless Latent Diffusion Local Editing with Discriminative Pixel-Space Refinement
CV and Pattern Recognition
Fixes weird spots in edited pictures.
Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion
CV and Pattern Recognition
Fixes AI art mistakes while it's being made.