SeedEdit 3.0: Fast and High-Quality Generative Image Editing
By: Peng Wang , Yichun Shi , Xiaochen Lian and more
Potential Business Impact:
Changes pictures exactly how you ask.
We introduce SeedEdit 3.0, in companion with our T2I model Seedream 3.0, which significantly improves over our previous SeedEdit versions in both aspects of edit instruction following and image content (e.g., ID/IP) preservation on real image inputs. Additional to model upgrading with T2I, in this report, we present several key improvements. First, we develop an enhanced data curation pipeline with a meta-info paradigm and meta-info embedding strategy that help mix images from multiple data sources. This allows us to scale editing data effectively, and meta information is helpfult to connect VLM with diffusion model more closely. Second, we introduce a joint learning pipeline for computing a diffusion loss and reward losses. Finally, we evaluate SeedEdit 3.0 on our testing benchmarks, for real/synthetic image editing, where it achieves a best trade-off between multiple aspects, yielding a high usability rate of 56.1%, compared to SeedEdit 1.6 (38.4%), GPT4o (37.1%) and Gemini 2.0 (30.3%).
Similar Papers
Seedream 4.0: Toward Next-generation Multimodal Image Generation
CV and Pattern Recognition
Makes pictures from words, edits them, and combines them.
Seedream 3.0 Technical Report
CV and Pattern Recognition
Creates better pictures from words, even Chinese.
Step1X-Edit: A Practical Framework for General Image Editing
CV and Pattern Recognition
Makes computer pictures change like magic.