ST-SAM: SAM-Driven Self-Training Framework for Semi-Supervised Camouflaged Object Detection
By: Xihang Hu , Fuming Sun , Jiazhe Liu and more
Potential Business Impact:
Teaches computers to find hidden things with less help.
Semi-supervised Camouflaged Object Detection (SSCOD) aims to reduce reliance on costly pixel-level annotations by leveraging limited annotated data and abundant unlabeled data. However, existing SSCOD methods based on Teacher-Student frameworks suffer from severe prediction bias and error propagation under scarce supervision, while their multi-network architectures incur high computational overhead and limited scalability. To overcome these limitations, we propose ST-SAM, a highly annotation-efficient yet concise framework that breaks away from conventional SSCOD constraints. Specifically, ST-SAM employs Self-Training strategy that dynamically filters and expands high-confidence pseudo-labels to enhance a single-model architecture, thereby fundamentally circumventing inter-model prediction bias. Furthermore, by transforming pseudo-labels into hybrid prompts containing domain-specific knowledge, ST-SAM effectively harnesses the Segment Anything Model's potential for specialized tasks to mitigate error accumulation in self-training. Experiments on COD benchmark datasets demonstrate that ST-SAM achieves state-of-the-art performance with only 1\% labeled data, outperforming existing SSCOD methods and even matching fully supervised methods. Remarkably, ST-SAM requires training only a single network, without relying on specific models or loss functions. This work establishes a new paradigm for annotation-efficient SSCOD. Codes will be available at https://github.com/hu-xh/ST-SAM.
Similar Papers
SAM-TTT: Segment Anything Model via Reverse Parameter Configuration and Test-Time Training for Camouflaged Object Detection
CV and Pattern Recognition
Helps computers find hidden things in pictures.
Promoting SAM for Camouflaged Object Detection via Selective Key Point-based Guidance
CV and Pattern Recognition
Finds hidden things in pictures better.
Improving SAM for Camouflaged Object Detection via Dual Stream Adapters
CV and Pattern Recognition
Helps computers find hidden things in pictures.