Dual Prompting Image Restoration with Diffusion Transformers
By: Dehong Kong , Fan Li , Zhixin Wang and more
Potential Business Impact:
Fixes blurry pictures using text and images.
Recent state-of-the-art image restoration methods mostly adopt latent diffusion models with U-Net backbones, yet still facing challenges in achieving high-quality restoration due to their limited capabilities. Diffusion transformers (DiTs), like SD3, are emerging as a promising alternative because of their better quality with scalability. In this paper, we introduce DPIR (Dual Prompting Image Restoration), a novel image restoration method that effectivly extracts conditional information of low-quality images from multiple perspectives. Specifically, DPIR consits of two branches: a low-quality image conditioning branch and a dual prompting control branch. The first branch utilizes a lightweight module to incorporate image priors into the DiT with high efficiency. More importantly, we believe that in image restoration, textual description alone cannot fully capture its rich visual characteristics. Therefore, a dual prompting module is designed to provide DiT with additional visual cues, capturing both global context and local appearance. The extracted global-local visual prompts as extra conditional control, alongside textual prompts to form dual prompts, greatly enhance the quality of the restoration. Extensive experimental results demonstrate that DPIR delivers superior image restoration performance.
Similar Papers
Unified Diffusion Transformer for High-fidelity Text-Aware Image Restoration
CV and Pattern Recognition
Fixes blurry text in pictures, making it readable.
MP-HSIR: A Multi-Prompt Framework for Universal Hyperspectral Image Restoration
CV and Pattern Recognition
Cleans up blurry pictures using words and images.
UniRestore: Unified Perceptual and Task-Oriented Image Restoration Model Using Diffusion Prior
Image and Video Processing
Cleans up blurry pictures for seeing and tasks.