SIRR-LMM: Single-image Reflection Removal via Large Multimodal Model
By: Yu Guo , Zhiqiang Lao , Xiyun Song and more
Potential Business Impact:
Cleans up messy reflections from glass pictures.
Glass surfaces create complex interactions of reflected and transmitted light, making single-image reflection removal (SIRR) challenging. Existing datasets suffer from limited physical realism in synthetic data or insufficient scale in real captures. We introduce a synthetic dataset generation framework that path-traces 3D glass models over real background imagery to create physically accurate reflection scenarios with varied glass properties, camera settings, and post-processing effects. To leverage the capabilities of Large Multimodal Model (LMM), we concatenate the image layers into a single composite input, apply joint captioning, and fine-tune the model using task-specific LoRA rather than full-parameter training. This enables our approach to achieve improved reflection removal and separation performance compared to state-of-the-art methods.
Similar Papers
OpenRR-5k: A Large-Scale Benchmark for Reflection Removal in the Wild
CV and Pattern Recognition
Cleans up blurry reflections in pictures.
Reflection Removal through Efficient Adaptation of Diffusion Transformers
CV and Pattern Recognition
Cleans up blurry photos by removing reflections.
SMIR: Efficient Synthetic Data Pipeline To Improve Multi-Image Reasoning
CV and Pattern Recognition
Teaches computers to understand many pictures together.