Score: 0

Beyond Weight Adaptation: Feature-Space Domain Injection for Cross-Modal Ship Re-Identification

Published: December 24, 2025 | arXiv ID: 2512.20892v1

By: Tingfeng Xian , Wenlve Zhou , Zhiheng Zhou and more

Cross-Modality Ship Re-Identification (CMS Re-ID) is critical for achieving all-day and all-weather maritime target tracking, yet it is fundamentally challenged by significant modality discrepancies. Mainstream solutions typically rely on explicit modality alignment strategies; however, this paradigm heavily depends on constructing large-scale paired datasets for pre-training. To address this, grounded in the Platonic Representation Hypothesis, we explore the potential of Vision Foundation Models (VFMs) in bridging modality gaps. Recognizing the suboptimal performance of existing generic Parameter-Efficient Fine-Tuning (PEFT) methods that operate within the weight space, particularly on limited-capacity models, we shift the optimization perspective to the feature space and propose a novel PEFT strategy termed Domain Representation Injection (DRI). Specifically, while keeping the VFM fully frozen to maximize the preservation of general knowledge, we design a lightweight, learnable Offset Encoder to extract domain-specific representations rich in modality and identity attributes from raw inputs. Guided by the contextual information of intermediate features at different layers, a Modulator adaptively transforms these representations. Subsequently, they are injected into the intermediate layers via additive fusion, dynamically reshaping the feature distribution to adapt to the downstream task without altering the VFM's pre-trained weights. Extensive experimental results demonstrate the superiority of our method, achieving State-of-the-Art (SOTA) performance with minimal trainable parameters. For instance, on the HOSS-ReID dataset, we attain 57.9\% and 60.5\% mAP using only 1.54M and 7.05M parameters, respectively. The code is available at https://github.com/TingfengXian/DRI.

Identity Clue Refinement and Enhancement for Visible-Infrared Person Re-Identification

CV and Pattern Recognition

Helps cameras find people in different light.

4 Dec 2025 1

88%

Learning Representation and Synergy Invariances: A Povable Framework for Generalized Multimodal Face Anti-Spoofing

CV and Pattern Recognition

Keeps fake faces from fooling face scanners.

18 Nov 2025 1

88%

Domain-Shared Learning and Gradual Alignment for Unsupervised Domain Adaptation Visible-Infrared Person Re-Identification

CV and Pattern Recognition

Helps cameras find people in different light.

20 Nov 2025 0

View PDF Login to Bookmark

Beyond Weight Adaptation: Feature-Space Domain Injection for Cross-Modal Ship Re-Identification

Technical Abstract

Identity Clue Refinement and Enhancement for Visible-Infrared Person Re-Identification

Learning Representation and Synergy Invariances: A Povable Framework for Generalized Multimodal Face Anti-Spoofing

Domain-Shared Learning and Gradual Alignment for Unsupervised Domain Adaptation Visible-Infrared Person Re-Identification