Expanding the Content-Style Frontier: a Balanced Subspace Blending Approach for Content-Style LoRA Fusion
By: Linhao Huang
Potential Business Impact:
Makes AI art keep its meaning at any style.
Recent advancements in text-to-image diffusion models have significantly improved the personalization and stylization of generated images. However, previous studies have only assessed content similarity under a single style intensity. In our experiments, we observe that increasing style intensity leads to a significant loss of content features, resulting in a suboptimal content-style frontier. To address this, we propose a novel approach to expand the content-style frontier by leveraging Content-Style Subspace Blending and a Content-Style Balance loss. Our method improves content similarity across varying style intensities, significantly broadening the content-style frontier. Extensive experiments demonstrate that our approach outperforms existing techniques in both qualitative and quantitative evaluations, achieving superior content-style trade-off with significantly lower Inverted Generational Distance (IGD) and Generational Distance (GD) scores compared to current methods.
Similar Papers
SCAdapter: Content-Style Disentanglement for Diffusion Style Transfer
CV and Pattern Recognition
Makes pictures look like famous paintings.
SplitFlux: Learning to Decouple Content and Style from a Single Image
CV and Pattern Recognition
Changes picture style without messing up the main subject.
LLM-Enabled Style and Content Regularization for Personalized Text-to-Image Generation
CV and Pattern Recognition
Makes AI pictures match your style better.