Referring Camouflaged Object Detection With Multi-Context Overlapped Windows Cross-Attention
By: Yu Wen , Shuyong Gao , Shuping Zhang and more
Potential Business Impact:
Finds hidden things using pictures and words.
Referring camouflaged object detection (Ref-COD) aims to identify hidden objects by incorporating reference information such as images and text descriptions. Previous research has transformed reference images with salient objects into one-dimensional prompts, yielding significant results. We explore ways to enhance performance through multi-context fusion of rich salient image features and camouflaged object features. Therefore, we propose RFMNet, which utilizes features from multiple encoding stages of the reference salient images and performs interactive fusion with the camouflage features at the corresponding encoding stages. Given that the features in salient object images contain abundant object-related detail information, performing feature fusion within local areas is more beneficial for detecting camouflaged objects. Therefore, we propose an Overlapped Windows Cross-attention mechanism to enable the model to focus more attention on the local information matching based on reference features. Besides, we propose the Referring Feature Aggregation (RFA) module to decode and segment the camouflaged objects progressively. Extensive experiments on the Ref-COD benchmark demonstrate that our method achieves state-of-the-art performance.
Similar Papers
RefOnce: Distilling References into a Prototype Memory for Referring Camouflaged Object Detection
CV and Pattern Recognition
Find hidden things without needing extra pictures.
Assisted Refinement Network Based on Channel Information Interaction for Camouflaged and Salient Object Detection
CV and Pattern Recognition
Find hidden things in pictures better.
C3Net: Context-Contrast Network for Camouflaged Object Detection
CV and Pattern Recognition
Finds hidden things that blend into backgrounds.