Reflection Removal through Efficient Adaptation of Diffusion Transformers
By: Daniyar Zakarin , Thiemo Wandel , Anton Obukhov and more
Potential Business Impact:
Cleans up blurry photos by removing reflections.
We introduce a diffusion-transformer (DiT) framework for single-image reflection removal that leverages the generalization strengths of foundation diffusion models in the restoration setting. Rather than relying on task-specific architectures, we repurpose a pre-trained DiT-based foundation model by conditioning it on reflection-contaminated inputs and guiding it toward clean transmission layers. We systematically analyze existing reflection removal data sources for diversity, scalability, and photorealism. To address the shortage of suitable data, we construct a physically based rendering (PBR) pipeline in Blender, built around the Principled BSDF, to synthesize realistic glass materials and reflection effects. Efficient LoRA-based adaptation of the foundation model, combined with the proposed synthetic data, achieves state-of-the-art performance on in-domain and zero-shot benchmarks. These results demonstrate that pretrained diffusion transformers, when paired with physically grounded data synthesis and efficient adaptation, offer a scalable and high-fidelity solution for reflection removal. Project page: https://hf.co/spaces/huawei-bayerlab/windowseat-reflection-removal-web
Similar Papers
Edit2Perceive: Image Editing Diffusion Models Are Strong Dense Perceivers
CV and Pattern Recognition
Makes computers understand pictures better for tasks.
From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning
CV and Pattern Recognition
Makes AI pictures better by letting it fix its own mistakes.
Harnessing Diffusion-Generated Synthetic Images for Fair Image Classification
CV and Pattern Recognition
Makes AI fairer by fixing biased training pictures.