Score: 0

DivAS: Interactive 3D Segmentation of NeRFs via Depth-Weighted Voxel Aggregation

Published: January 8, 2026 | arXiv ID: 2601.04860v1

By: Ayush Pande

Potential Business Impact:

Makes 3D scenes understandable instantly.

Business Areas:

Image Recognition Data and Analytics, Software

Existing methods for segmenting Neural Radiance Fields (NeRFs) are often optimization-based, requiring slow per-scene training that sacrifices the zero-shot capabilities of 2D foundation models. We introduce DivAS (Depth-interactive Voxel Aggregation Segmentation), an optimization-free, fully interactive framework that addresses these limitations. Our method operates via a fast GUI-based workflow where 2D SAM masks, generated from user point prompts, are refined using NeRF-derived depth priors to improve geometric accuracy and foreground-background separation. The core of our contribution is a custom CUDA kernel that aggregates these refined multi-view masks into a unified 3D voxel grid in under 200ms, enabling real-time visual feedback. This optimization-free design eliminates the need for per-scene training. Experiments on Mip-NeRF 360° and LLFF show that DivAS achieves segmentation quality comparable to optimization-based methods, while being 2-2.5x faster end-to-end, and up to an order of magnitude faster when excluding user prompting time.

UniC-Lift: Unified 3D Instance Segmentation via Contrastive Learning

CV and Pattern Recognition

Makes 3D pictures understand objects better.

31 Dec 2025 1

87%

ViSNeRF: Efficient Multidimensional Neural Radiance Field Representation for Visualization Synthesis of Dynamic Volumetric Scenes

Graphics

Creates new science pictures from few examples.

23 Feb 2025 2

87%

Joint Learning of Depth, Pose, and Local Radiance Field for Large Scale Monocular 3D Reconstruction

CV and Pattern Recognition

Creates 3D worlds from one camera.

20 Dec 2025 1

View PDF Login to Bookmark

Country of Origin

🇮🇳 India

Page Count

10 pages

DivAS: Interactive 3D Segmentation of NeRFs via Depth-Weighted Voxel Aggregation

Makes 3D scenes understandable instantly.

Technical Abstract

UniC-Lift: Unified 3D Instance Segmentation via Contrastive Learning

ViSNeRF: Efficient Multidimensional Neural Radiance Field Representation for Visualization Synthesis of Dynamic Volumetric Scenes

Joint Learning of Depth, Pose, and Local Radiance Field for Large Scale Monocular 3D Reconstruction