Score: 0

Rethinking Infrared Small Target Detection: A Foundation-Driven Efficient Paradigm

Published: December 5, 2025 | arXiv ID: 2512.05511v1

By: Chuang Yu , Jinmiao Zhao , Yunpeng Liu and more

While large-scale visual foundation models (VFMs) exhibit strong generalization across diverse visual domains, their potential for single-frame infrared small target (SIRST) detection remains largely unexplored. To fill this gap, we systematically introduce the frozen representations from VFMs into the SIRST task for the first time and propose a Foundation-Driven Efficient Paradigm (FDEP), which can seamlessly adapt to existing encoder-decoder-based methods and significantly improve accuracy without additional inference overhead. Specifically, a Semantic Alignment Modulation Fusion (SAMF) module is designed to achieve dynamic alignment and deep fusion of the global semantic priors from VFMs with task-specific features. Meanwhile, to avoid the inference time burden introduced by VFMs, we propose a Collaborative Optimization-based Implicit Self-Distillation (CO-ISD) strategy, which enables implicit semantic transfer between the main and lightweight branches through parameter sharing and synchronized backpropagation. In addition, to unify the fragmented evaluation system, we construct a Holistic SIRST Evaluation (HSE) metric that performs multi-threshold integral evaluation at both pixel-level confidence and target-level robustness, providing a stable and comprehensive basis for fair model comparison. Extensive experiments demonstrate that the SIRST detection networks equipped with our FDEP framework achieve state-of-the-art (SOTA) performance on multiple public datasets. Our code is available at https://github.com/YuChuang1205/FDEP-Framework

IrisNet: Infrared Image Status Awareness Meta Decoder for Infrared Small Targets Detection

CV and Pattern Recognition

Helps cameras find tiny, faint things in pictures.

25 Nov 2025 1

90%

NS-FPN: Improving Infrared Small Target Detection and Segmentation from Noise Suppression Perspective

CV and Pattern Recognition

Finds tiny things in blurry pictures better.

9 Aug 2025 0

90%

Text-IRSTD: Leveraging Semantic Text to Promote Infrared Small Target Detection in Complex Scenes

CV and Pattern Recognition

Helps computers find tiny things in heat pictures.

10 Mar 2025 1

View PDF Login to Bookmark

Rethinking Infrared Small Target Detection: A Foundation-Driven Efficient Paradigm

Technical Abstract

IrisNet: Infrared Image Status Awareness Meta Decoder for Infrared Small Targets Detection

NS-FPN: Improving Infrared Small Target Detection and Segmentation from Noise Suppression Perspective

Text-IRSTD: Leveraging Semantic Text to Promote Infrared Small Target Detection in Complex Scenes