Score: 1

Zero-shot segmentation of skin tumors in whole-slide images with vision-language foundation models

Published: November 24, 2025 | arXiv ID: 2511.18978v1

By: Santiago Moreno , Pablo Meseguer , Rocío del Amor and more

Potential Business Impact:

Finds skin cancer on tissue pictures.

Business Areas:

Image Recognition Data and Analytics, Software

Accurate annotation of cutaneous neoplasm biopsies represents a major challenge due to their wide morphological variability, overlapping histological patterns, and the subtle distinctions between benign and malignant lesions. Vision-language foundation models (VLMs), pre-trained on paired image-text corpora, learn joint representations that bridge visual features and diagnostic terminology, enabling zero-shot localization and classification of tissue regions without pixel-level labels. However, most existing VLM applications in histopathology remain limited to slide-level tasks or rely on coarse interactive prompts, and they struggle to produce fine-grained segmentations across gigapixel whole-slide images (WSIs). In this work, we introduce a zero-shot visual-language segmentation pipeline for whole-slide images (ZEUS), a fully automated, zero-shot segmentation framework that leverages class-specific textual prompt ensembles and frozen VLM encoders to generate high-resolution tumor masks in WSIs. By partitioning each WSI into overlapping patches, extracting visual embeddings, and computing cosine similarities against text prompts, we generate a final segmentation mask. We demonstrate competitive performance on two in-house datasets, primary spindle cell neoplasms and cutaneous metastases, highlighting the influence of prompt design, domain shifts, and institutional variability in VLMs for histopathology. ZEUS markedly reduces annotation burden while offering scalable, explainable tumor delineation for downstream diagnostic workflows.

Investigating Zero-Shot Diagnostic Pathology in Vision-Language Models with Efficient Prompt Design

CV and Pattern Recognition

Helps doctors find cancer faster using AI pictures.

30 Apr 2025 1

90%

ZeroSlide: Is Zero-Shot Classification Adequate for Lifelong Learning in Whole-Slide Image Analysis in the Era of Pathology Vision-Language Foundation Models?

CV and Pattern Recognition

Helps doctors diagnose diseases faster with AI.

22 Apr 2025 0

90%

Synthetic Captions for Open-Vocabulary Zero-Shot Segmentation

CV and Pattern Recognition

Lets computers understand pictures better.

15 Sep 2025 1

View PDF Login to Bookmark

Country of Origin

🇪🇸 Spain

Repos / Data Links

github.com

Page Count

4 pages

Zero-shot segmentation of skin tumors in whole-slide images with vision-language foundation models

Finds skin cancer on tissue pictures.

Technical Abstract

Investigating Zero-Shot Diagnostic Pathology in Vision-Language Models with Efficient Prompt Design

ZeroSlide: Is Zero-Shot Classification Adequate for Lifelong Learning in Whole-Slide Image Analysis in the Era of Pathology Vision-Language Foundation Models?

Synthetic Captions for Open-Vocabulary Zero-Shot Segmentation