Score: 1

SIRR-LMM: Single-image Reflection Removal via Large Multimodal Model

Published: January 12, 2026 | arXiv ID: 2601.07209v1

By: Yu Guo , Zhiqiang Lao , Xiyun Song and more

Potential Business Impact:

Cleans up messy reflections from glass pictures.

Business Areas:
Image Recognition Data and Analytics, Software

Glass surfaces create complex interactions of reflected and transmitted light, making single-image reflection removal (SIRR) challenging. Existing datasets suffer from limited physical realism in synthetic data or insufficient scale in real captures. We introduce a synthetic dataset generation framework that path-traces 3D glass models over real background imagery to create physically accurate reflection scenarios with varied glass properties, camera settings, and post-processing effects. To leverage the capabilities of Large Multimodal Model (LMM), we concatenate the image layers into a single composite input, apply joint captioning, and fine-tune the model using task-specific LoRA rather than full-parameter training. This enables our approach to achieve improved reflection removal and separation performance compared to state-of-the-art methods.

Page Count
12 pages

Category
Computer Science:
CV and Pattern Recognition