HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance
By: Jiazi Bu , Pengyang Ling , Yujie Zhou and more
Potential Business Impact:
Makes computer pictures sharper and more detailed.
Text-to-image (T2I) diffusion/flow models have drawn considerable attention recently due to their remarkable ability to deliver flexible visual creations. Still, high-resolution image synthesis presents formidable challenges due to the scarcity and complexity of high-resolution content. Recent approaches have investigated training-free strategies to enable high-resolution image synthesis with pre-trained models. However, these techniques often struggle with generating high-quality visuals and tend to exhibit artifacts or low-fidelity details, as they typically rely solely on the endpoint of the low-resolution sampling trajectory while neglecting intermediate states that are critical for preserving structure and synthesizing finer detail. To this end, we present HiFlow, a training-free and model-agnostic framework to unlock the resolution potential of pre-trained flow models. Specifically, HiFlow establishes a virtual reference flow within the high-resolution space that effectively captures the characteristics of low-resolution flow information, offering guidance for high-resolution generation through three key aspects: initialization alignment for low-frequency consistency, direction alignment for structure preservation, and acceleration alignment for detail fidelity. By leveraging such flow-aligned guidance, HiFlow substantially elevates the quality of high-resolution image synthesis of T2I models and demonstrates versatility across their personalized variants. Extensive experiments validate HiFlow's capability in achieving superior high-resolution image quality over state-of-the-art methods.
Similar Papers
FlowSteer: Conditioning Flow Field for Consistent Image Restoration
Image and Video Processing
Fixes blurry pictures by using smart guessing.
A Training-Free Style-aligned Image Generation with Scale-wise Autoregressive Model
CV and Pattern Recognition
Makes AI pictures match the style you want.
TF-TI2I: Training-Free Text-and-Image-to-Image Generation via Multi-Modal Implicit-Context Learning in Text-to-Image Models
CV and Pattern Recognition
Makes pictures from text and other pictures.