Borrowing from anything: A generalizable framework for reference-guided instance editing
By: Shengxiao Zhou , Chenghua Li , Jianhao Huang and more
Potential Business Impact:
Changes pictures precisely without messing them up.
Reference-guided instance editing is fundamentally limited by semantic entanglement, where a reference's intrinsic appearance is intertwined with its extrinsic attributes. The key challenge lies in disentangling what information should be borrowed from the reference, and determining how to apply it appropriately to the target. To tackle this challenge, we propose GENIE, a Generalizable Instance Editing framework capable of achieving explicit disentanglement. GENIE first corrects spatial misalignments with a Spatial Alignment Module (SAM). Then, an Adaptive Residual Scaling Module (ARSM) learns what to borrow by amplifying salient intrinsic cues while suppressing extrinsic attributes, while a Progressive Attention Fusion (PAF) mechanism learns how to render this appearance onto the target, preserving its structure. Extensive experiments on the challenging AnyInsertion dataset demonstrate that GENIE achieves state-of-the-art fidelity and robustness, setting a new standard for disentanglement-based instance editing.
Similar Papers
Reversible Inversion for Training-Free Exemplar-guided Image Editing
CV and Pattern Recognition
Changes pictures using a guide picture.
Latent Expression Generation for Referring Image Segmentation and Grounding
CV and Pattern Recognition
Finds the right object even with tricky descriptions.
Latent Expression Generation for Referring Image Segmentation and Grounding
CV and Pattern Recognition
Finds the right object even with tricky descriptions.