DGA-Net: Enhancing SAM with Depth Prompting and Graph-Anchor Guidance for Camouflaged Object Detection
By: Yuetong Li , Qing Zhang , Yilin Zhao and more
Potential Business Impact:
Finds hidden things in pictures using depth.
To fully exploit depth cues in Camouflaged Object Detection (COD), we present DGA-Net, a specialized framework that adapts the Segment Anything Model (SAM) via a novel ``depth prompting" paradigm. Distinguished from existing approaches that primarily rely on sparse prompts (e.g., points or boxes), our method introduces a holistic mechanism for constructing and propagating dense depth prompts. Specifically, we propose a Cross-modal Graph Enhancement (CGE) module that synthesizes RGB semantics and depth geometric within a heterogeneous graph to form a unified guidance signal. Furthermore, we design an Anchor-Guided Refinement (AGR) module. To counteract the inherent information decay in feature hierarchies, AGR forges a global anchor and establishes direct non-local pathways to broadcast this guidance from deep to shallow layers, ensuring precise and consistent segmentation. Quantitative and qualitative experimental results demonstrate that our proposed DGA-Net outperforms the state-of-the-art COD methods.
Similar Papers
SAM-DAQ: Segment Anything Model with Depth-guided Adaptive Queries for RGB-D Video Salient Object Detection
CV and Pattern Recognition
Helps computers find moving objects in videos.
APGNet: Adaptive Prior-Guided for Underwater Camouflaged Object Detection
CV and Pattern Recognition
Find hidden sea creatures in murky water.
SFGNet: Semantic and Frequency Guided Network for Camouflaged Object Detection
CV and Pattern Recognition
Find hidden things in pictures better.