Structure-guided Diffusion Transformer for Low-Light Image Enhancement
By: Xiangchen Yin , Zhenda Yu , Longtao Jiang and more
Potential Business Impact:
Makes dark pictures clear without extra fuzz.
While the diffusion transformer (DiT) has become a focal point of interest in recent years, its application in low-light image enhancement remains a blank area for exploration. Current methods recover the details from low-light images while inevitably amplifying the noise in images, resulting in poor visual quality. In this paper, we firstly introduce DiT into the low-light enhancement task and design a novel Structure-guided Diffusion Transformer based Low-light image enhancement (SDTL) framework. We compress the feature through wavelet transform to improve the inference efficiency of the model and capture the multi-directional frequency band. Then we propose a Structure Enhancement Module (SEM) that uses structural prior to enhance the texture and leverages an adaptive fusion strategy to achieve more accurate enhancement effect. In Addition, we propose a Structure-guided Attention Block (SAB) to pay more attention to texture-riched tokens and avoid interference from noisy areas in noise prediction. Extensive qualitative and quantitative experiments demonstrate that our method achieves SOTA performance on several popular datasets, validating the effectiveness of SDTL in improving image quality and the potential of DiT in low-light enhancement tasks.
Similar Papers
EDiT: Efficient Diffusion Transformers with Linear Compressed Attention
CV and Pattern Recognition
Makes AI create better pictures faster.
D$^2$iT: Dynamic Diffusion Transformer for Accurate Image Generation
CV and Pattern Recognition
Makes computer pictures look more real and detailed.
Enhancing Image Generation Fidelity via Progressive Prompts
CV and Pattern Recognition
Makes AI draw pictures exactly where you want.