Score: 0

Dual-Domain Perspective on Degradation-Aware Fusion: A VLM-Guided Robust Infrared and Visible Image Fusion Framework

Published: September 5, 2025 | arXiv ID: 2509.05000v1

By: Tianpei Zhang , Jufeng Zhao , Yiming Zhu and more

Potential Business Impact:

Improves blurry night vision pictures by combining them.

Business Areas:

Image Recognition Data and Analytics, Software

Most existing infrared-visible image fusion (IVIF) methods assume high-quality inputs, and therefore struggle to handle dual-source degraded scenarios, typically requiring manual selection and sequential application of multiple pre-enhancement steps. This decoupled pre-enhancement-to-fusion pipeline inevitably leads to error accumulation and performance degradation. To overcome these limitations, we propose Guided Dual-Domain Fusion (GD^2Fusion), a novel framework that synergistically integrates vision-language models (VLMs) for degradation perception with dual-domain (frequency/spatial) joint optimization. Concretely, the designed Guided Frequency Modality-Specific Extraction (GFMSE) module performs frequency-domain degradation perception and suppression and discriminatively extracts fusion-relevant sub-band features. Meanwhile, the Guided Spatial Modality-Aggregated Fusion (GSMAF) module carries out cross-modal degradation filtering and adaptive multi-source feature aggregation in the spatial domain to enhance modality complementarity and structural consistency. Extensive qualitative and quantitative experiments demonstrate that GD^2Fusion achieves superior fusion performance compared with existing algorithms and strategies in dual-source degraded scenarios. The code will be publicly released after acceptance of this paper.

Coupled Degradation Modeling and Fusion: A VLM-Guided Degradation-Coupled Network for Degradation-Aware Infrared and Visible Image Fusion

CV and Pattern Recognition

Fixes blurry pictures so you can see details.

13 Oct 2025 2

91%

MdaIF: Robust One-Stop Multi-Degradation-Aware Image Fusion with Language-Driven Semantics

CV and Pattern Recognition

Cleans up blurry pictures from bad weather.

16 Nov 2025 2

89%

Modality-Aware Infrared and Visible Image Fusion with Target-Aware Supervision

CV and Pattern Recognition

Makes cameras see better in fog and dark.

14 Sep 2025 0

View PDF Login to Bookmark

Page Count

18 pages

Dual-Domain Perspective on Degradation-Aware Fusion: A VLM-Guided Robust Infrared and Visible Image Fusion Framework

Improves blurry night vision pictures by combining them.

Technical Abstract

Coupled Degradation Modeling and Fusion: A VLM-Guided Degradation-Coupled Network for Degradation-Aware Infrared and Visible Image Fusion

MdaIF: Robust One-Stop Multi-Degradation-Aware Image Fusion with Language-Driven Semantics

Modality-Aware Infrared and Visible Image Fusion with Target-Aware Supervision