Score: 0

Contrast-Guided Cross-Modal Distillation for Thermal Object Detection

Published: November 3, 2025 | arXiv ID: 2511.01435v1

By: SiWoo Kim, JhongHyun An

Potential Business Impact:

Improves night vision cameras to see better.

Business Areas:

Image Recognition Data and Analytics, Software

Robust perception at night remains challenging for thermal-infrared detection: low contrast and weak high-frequency cues lead to duplicate, overlapping boxes, missed small objects, and class confusion. Prior remedies either translate TIR to RGB and hope pixel fidelity transfers to detection -- making performance fragile to color or structure artifacts -- or fuse RGB and TIR at test time, which requires extra sensors, precise calibration, and higher runtime cost. Both lines can help in favorable conditions, but do not directly shape the thermal representation used by the detector. We keep mono-modality inference and tackle the root causes during training. Specifically, we introduce training-only objectives that sharpen instance-level decision boundaries by pulling together features of the same class and pushing apart those of different classes -- suppressing duplicate and confusing detections -- and that inject cross-modal semantic priors by aligning the student's multi-level pyramid features with an RGB-trained teacher, thereby strengthening texture-poor thermal features without visible input at test time. In experiments, our method outperformed prior approaches and achieved state-of-the-art performance.

3M-TI: High-Quality Mobile Thermal Imaging via Calibration-free Multi-Camera Cross-Modal Diffusion

CV and Pattern Recognition

Makes phone cameras see heat better, clearer.

24 Nov 2025 2

88%

Dual-Granularity Semantic Prompting for Language Guidance Infrared Small Target Detection

CV and Pattern Recognition

Finds tiny things in dark pictures using words.

24 Nov 2025 1

88%

Fusion or Confusion? Assessing the impact of visible-thermal image fusion for automated wildlife detection

CV and Pattern Recognition

Helps find birds and nests from the sky.

27 Nov 2025 0

View PDF Login to Bookmark

Country of Origin

🇰🇷 Korea, Republic of

Page Count

6 pages

Contrast-Guided Cross-Modal Distillation for Thermal Object Detection

Improves night vision cameras to see better.

Technical Abstract

3M-TI: High-Quality Mobile Thermal Imaging via Calibration-free Multi-Camera Cross-Modal Diffusion

Dual-Granularity Semantic Prompting for Language Guidance Infrared Small Target Detection

Fusion or Confusion? Assessing the impact of visible-thermal image fusion for automated wildlife detection