Score: 0

More than Segmentation: Benchmarking SAM 3 for Segmentation, 3D Perception, and Reconstruction in Robotic Surgery

Published: December 8, 2025 | arXiv ID: 2512.07596v1

By: Wenzhen Dong , Jieming Yu , Yiming Huang and more

Potential Business Impact:

Helps robots see and build body parts in surgery.

Business Areas:

Image Recognition Data and Analytics, Software

The recent Segment Anything Model (SAM) 3 has introduced significant advancements over its predecessor, SAM 2, particularly with the integration of language-based segmentation and enhanced 3D perception capabilities. SAM 3 supports zero-shot segmentation across a wide range of prompts, including point, bounding box, and language-based prompts, allowing for more flexible and intuitive interactions with the model. In this empirical evaluation, we assess the performance of SAM 3 in robot-assisted surgery, benchmarking its zero-shot segmentation with point and bounding box prompts and exploring its effectiveness in dynamic video tracking, alongside its newly introduced language prompt segmentation. While language prompts show potential, their performance in the surgical domain is currently suboptimal, highlighting the need for further domain-specific training. Additionally, we investigate SAM 3's 3D reconstruction abilities, demonstrating its capacity to process surgical scene data and reconstruct 3D anatomical structures from 2D images. Through comprehensive testing on the MICCAI EndoVis 2017 and EndoVis 2018 benchmarks, SAM 3 shows clear improvements over SAM and SAM 2 in both image and video segmentation under spatial prompts, while zero-shot evaluations on SCARED, StereoMIS, and EndoNeRF indicate strong monocular depth estimation and realistic 3D instrument reconstruction, yet also reveal remaining limitations in complex, highly dynamic surgical scenes.

More than Segmentation: Benchmarking SAM 3 for Segmentation, 3D Perception, and Reconstruction in Robotic Surgery

CV and Pattern Recognition

Helps robots see and understand surgery in 3D.

8 Dec 2025 0

92%

MedSAM3: Delving into Segment Anything with Medical Concepts

CV and Pattern Recognition

Lets doctors find body parts in scans with words.

24 Nov 2025 1

92%

Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation

CV and Pattern Recognition

Helps doctors find body parts in medical scans.

15 Jan 2026 1

View PDF Login to Bookmark

Page Count

11 pages

More than Segmentation: Benchmarking SAM 3 for Segmentation, 3D Perception, and Reconstruction in Robotic Surgery

Helps robots see and build body parts in surgery.

Technical Abstract

More than Segmentation: Benchmarking SAM 3 for Segmentation, 3D Perception, and Reconstruction in Robotic Surgery

MedSAM3: Delving into Segment Anything with Medical Concepts

Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation