SplitFlux: Learning to Decouple Content and Style from a Single Image
By: Yitong Yang , Yinglin Wang , Changshuo Wang and more
Potential Business Impact:
Changes picture style without messing up the main subject.
Disentangling image content and style is essential for customized image generation. Existing SDXL-based methods struggle to achieve high-quality results, while the recently proposed Flux model fails to achieve effective content-style separation due to its underexplored characteristics. To address these challenges, we conduct a systematic analysis of Flux and make two key observations: (1) Single Dream Blocks are essential for image generation; and (2) Early single stream blocks mainly control content, whereas later blocks govern style. Based on these insights, we propose SplitFlux, which disentangles content and style by fine-tuning the single dream blocks via LoRA, enabling the disentangled content to be re-embedded into new contexts. It includes two key components: (1) Rank-Constrained Adaptation. To preserve content identity and structure, we compress the rank and amplify the magnitude of updates within specific blocks, preventing content leakage into style blocks. (2) Visual-Gated LoRA. We split the content LoRA into two branches with different ranks, guided by image saliency. The high-rank branch preserves primary subject information, while the low-rank branch encodes residual details, mitigating content overfitting and enabling seamless re-embedding. Extensive experiments demonstrate that SplitFlux consistently outperforms state-of-the-art methods, achieving superior content preservation and stylization quality across diverse scenarios.
Similar Papers
SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models
CV and Pattern Recognition
Lets computers change picture styles without losing meaning.
Inversion-Free Style Transfer with Dual Rectified Flows
CV and Pattern Recognition
Makes pictures look like art, super fast.
Expanding the Content-Style Frontier: a Balanced Subspace Blending Approach for Content-Style LoRA Fusion
CV and Pattern Recognition
Makes AI art keep its meaning at any style.