Score: 2

Evolving, Not Training: Zero-Shot Reasoning Segmentation via Evolutionary Prompting

Published: December 31, 2025 | arXiv ID: 2512.24702v1

By: Kai Ye , Xiaotong You , Jianghang Lin and more

Potential Business Impact:

Finds objects in pictures by guessing and improving.

Business Areas:

Semantic Search Internet Services

Reasoning Segmentation requires models to interpret complex, context-dependent linguistic queries to achieve pixel-level localization. Current dominant approaches rely heavily on Supervised Fine-Tuning (SFT) or Reinforcement Learning (RL). However, SFT suffers from catastrophic forgetting and domain dependency, while RL is often hindered by training instability and rigid reliance on predefined reward functions. Although recent training-free methods circumvent these training burdens, they are fundamentally limited by a static inference paradigm. These methods typically rely on a single-pass "generate-then-segment" chain, which suffers from insufficient reasoning depth and lacks the capability to self-correct linguistic hallucinations or spatial misinterpretations. In this paper, we challenge these limitations and propose EVOL-SAM3, a novel zero-shot framework that reformulates reasoning segmentation as an inference-time evolutionary search process. Instead of relying on a fixed prompt, EVOL-SAM3 maintains a population of prompt hypotheses and iteratively refines them through a "Generate-Evaluate-Evolve" loop. We introduce a Visual Arena to assess prompt fitness via reference-free pairwise tournaments, and a Semantic Mutation operator to inject diversity and correct semantic errors. Furthermore, a Heterogeneous Arena module integrates geometric priors with semantic reasoning to ensure robust final selection. Extensive experiments demonstrate that EVOL-SAM3 not only substantially outperforms static baselines but also significantly surpasses fully supervised state-of-the-art methods on the challenging ReasonSeg benchmark in a zero-shot setting. The code is available at https://github.com/AHideoKuzeA/Evol-SAM3.

ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning

CV and Pattern Recognition

Helps computers understand moving objects in videos.

2 Dec 2025 1

88%

Bridging Semantics and Geometry: A Decoupled LVLM-SAM Framework for Reasoning Segmentation in Remote Sensing

CV and Pattern Recognition

Helps computers understand satellite pictures better.

22 Dec 2025 2

88%

Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement

CV and Pattern Recognition

Lets computers understand and draw any object.

9 Mar 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Repos / Data Links

github.com

Page Count

11 pages

Evolving, Not Training: Zero-Shot Reasoning Segmentation via Evolutionary Prompting

Finds objects in pictures by guessing and improving.

Technical Abstract

ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning

Bridging Semantics and Geometry: A Decoupled LVLM-SAM Framework for Reasoning Segmentation in Remote Sensing

Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement