Score: 1

DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation

Published: March 6, 2025 | arXiv ID: 2503.04006v1

By: Amin Karimi, Charalambos Poullis

Potential Business Impact:

Teaches computers to recognize new things with few examples.

Business Areas:

Image Recognition Data and Analytics, Software

Few-shot semantic segmentation (FSS) aims to enable models to segment novel/unseen object classes using only a limited number of labeled examples. However, current FSS methods frequently struggle with generalization due to incomplete and biased feature representations, especially when support images do not capture the full appearance variability of the target class. To improve the FSS pipeline, we propose a novel framework that utilizes large language models (LLMs) to adapt general class semantic information to the query image. Furthermore, the framework employs dense pixel-wise matching to identify similarities between query and support images, resulting in enhanced FSS performance. Inspired by reasoning-based segmentation frameworks, our method, named DSV-LFS, introduces an additional token into the LLM vocabulary, allowing a multimodal LLM to generate a "semantic prompt" from class descriptions. In parallel, a dense matching module identifies visual similarities between the query and support images, generating a "visual prompt". These prompts are then jointly employed to guide the prompt-based decoder for accurate segmentation of the query image. Comprehensive experiments on the benchmark datasets Pascal-$5^{i}$ and COCO-$20^{i}$ demonstrate that our framework achieves state-of-the-art performance-by a significant margin-demonstrating superior generalization to novel classes and robustness across diverse scenarios. The source code is available at \href{https://github.com/aminpdik/DSV-LFS}{https://github.com/aminpdik/DSV-LFS}

Beyond Visual Cues: Leveraging General Semantics as Support for Few-Shot Segmentation

CV and Pattern Recognition

Teaches computers to recognize new things from few examples.

20 Nov 2025 1

90%

DistillFSS: Synthesizing Few-Shot Knowledge into a Lightweight Segmentation Model

CV and Pattern Recognition

Teaches computers to recognize new things with few examples.

5 Dec 2025 3

90%

Overcoming Support Dilution for Robust Few-shot Semantic Segmentation

CV and Pattern Recognition

Helps computers find objects with few examples.

23 Jan 2025 1

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

16 pages

DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation

Teaches computers to recognize new things with few examples.

Technical Abstract

Beyond Visual Cues: Leveraging General Semantics as Support for Few-Shot Segmentation

DistillFSS: Synthesizing Few-Shot Knowledge into a Lightweight Segmentation Model

Overcoming Support Dilution for Robust Few-shot Semantic Segmentation