Score: 0

LAMS-Edit: Latent and Attention Mixing with Schedulers for Improved Content Preservation in Diffusion-Based Image and Style Editing

Published: January 6, 2026 | arXiv ID: 2601.02987v1

By: Wingwa Fu, Takayuki Okatani

Potential Business Impact:

Changes pictures accurately by blending ideas.

Business Areas:
Photo Editing Content and Publishing, Media and Entertainment

Text-to-Image editing using diffusion models faces challenges in balancing content preservation with edit application and handling real-image editing. To address these, we propose LAMS-Edit, leveraging intermediate states from the inversion process--an essential step in real-image editing--during edited image generation. Specifically, latent representations and attention maps from both processes are combined at each step using weighted interpolation, controlled by a scheduler. This technique, Latent and Attention Mixing with Schedulers (LAMS), integrates with Prompt-to-Prompt (P2P) to form LAMS-Edit--an extensible framework that supports precise editing with region masks and enables style transfer via LoRA. Extensive experiments demonstrate that LAMS-Edit effectively balances content preservation and edit application.

Country of Origin
šŸ‡ÆšŸ‡µ Japan

Page Count
15 pages

Category
Computer Science:
CV and Pattern Recognition