Bridging Fidelity-Reality with Controllable One-Step Diffusion for Image Super-Resolution
By: Hao Chen , Junyang Chen , Jinshan Pan and more
Potential Business Impact:
Makes blurry pictures sharp and clear.
Recent diffusion-based one-step methods have shown remarkable progress in the field of image super-resolution, yet they remain constrained by three critical limitations: (1) inferior fidelity performance caused by the information loss from compression encoding of low-quality (LQ) inputs; (2) insufficient region-discriminative activation of generative priors; (3) misalignment between text prompts and their corresponding semantic regions. To address these limitations, we propose CODSR, a controllable one-step diffusion network for image super-resolution. First, we propose an LQ-guided feature modulation module that leverages original uncompressed information from LQ inputs to provide high-fidelity conditioning for the diffusion process. We then develop a region-adaptive generative prior activation method to effectively enhance perceptual richness without sacrificing local structural fidelity. Finally, we employ a text-matching guidance strategy to fully harness the conditioning potential of text prompts. Extensive experiments demonstrate that CODSR achieves superior perceptual quality and competitive fidelity compared with state-of-the-art methods with efficient one-step inference.
Similar Papers
One-Step Diffusion Transformer for Controllable Real-World Image Super-Resolution
CV and Pattern Recognition
Makes blurry pictures sharp and clear, with control.
Realism Control One-step Diffusion for Real-World Image Super-Resolution
CV and Pattern Recognition
Makes blurry pictures sharp and clear.
CTSR: Controllable Fidelity-Realness Trade-off Distillation for Real-World Image Super Resolution
CV and Pattern Recognition
Makes blurry pictures sharp and clear.