Score: 1

Beyond Visual Cues: Leveraging General Semantics as Support for Few-Shot Segmentation

Published: November 20, 2025 | arXiv ID: 2511.16435v1

By: Jin Wang , Bingfeng Zhang , Jian Pang and more

Potential Business Impact:

Teaches computers to recognize new things from few examples.

Business Areas:

Image Recognition Data and Analytics, Software

Few-shot segmentation (FSS) aims to segment novel classes under the guidance of limited support samples by a meta-learning paradigm. Existing methods mainly mine references from support images as meta guidance. However, due to intra-class variations among visual representations, the meta information extracted from support images cannot produce accurate guidance to segment untrained classes. In this paper, we argue that the references from support images may not be essential, the key to the support role is to provide unbiased meta guidance for both trained and untrained classes. We then introduce a Language-Driven Attribute Generalization (LDAG) architecture to utilize inherent target property language descriptions to build robust support strategy. Specifically, to obtain an unbiased support representation, we design a Multi-attribute Enhancement (MaE) module, which produces multiple detailed attribute descriptions of the target class through Large Language Models (LLMs), and then builds refined visual-text prior guidance utilizing multi-modal matching. Meanwhile, due to text-vision modal shift, attribute text struggles to promote visual feature representation, we design a Multi-modal Attribute Alignment (MaA) to achieve cross-modal interaction between attribute texts and visual feature. Experiments show that our proposed method outperforms existing approaches by a clear margin and achieves the new state-of-the art performance. The code will be released.

DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation

CV and Pattern Recognition

Teaches computers to recognize new things with few examples.

6 Mar 2025 1

89%

DistillFSS: Synthesizing Few-Shot Knowledge into a Lightweight Segmentation Model

CV and Pattern Recognition

Teaches computers to recognize new things with few examples.

5 Dec 2025 3

89%

Object-level Correlation for Few-Shot Segmentation

CV and Pattern Recognition

Helps computers find objects with few examples.

9 Sep 2025 1

View PDF Login to Bookmark

Page Count

19 pages

Beyond Visual Cues: Leveraging General Semantics as Support for Few-Shot Segmentation

Teaches computers to recognize new things from few examples.

Technical Abstract

DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation

DistillFSS: Synthesizing Few-Shot Knowledge into a Lightweight Segmentation Model

Object-level Correlation for Few-Shot Segmentation