Score: 1

FlowDC: Flow-Based Decoupling-Decay for Complex Image Editing

Published: December 12, 2025 | arXiv ID: 2512.11395v1

By: Yilei Jiang , Zhen Wang , Yanghao Wang and more

Potential Business Impact:

Edits pictures with many changes at once.

Business Areas:
Photo Editing Content and Publishing, Media and Entertainment

With the surge of pre-trained text-to-image flow matching models, text-based image editing performance has gained remarkable improvement, especially for \underline{simple editing} that only contains a single editing target. To satisfy the exploding editing requirements, the \underline{complex editing} which contains multiple editing targets has posed as a more challenging task. However, current complex editing solutions: single-round and multi-round editing are limited by long text following and cumulative inconsistency, respectively. Thus, they struggle to strike a balance between semantic alignment and source consistency. In this paper, we propose \textbf{FlowDC}, which decouples the complex editing into multiple sub-editing effects and superposes them in parallel during the editing process. Meanwhile, we observed that the velocity quantity that is orthogonal to the editing displacement harms the source structure preserving. Thus, we decompose the velocity and decay the orthogonal part for better source consistency. To evaluate the effectiveness of complex editing settings, we construct a complex editing benchmark: Complex-PIE-Bench. On two benchmarks, FlowDC shows superior results compared with existing methods. We also detail the ablations of our module designs.

Country of Origin
🇭🇰 🇨🇳 China, Hong Kong

Page Count
19 pages

Category
Computer Science:
CV and Pattern Recognition