DiSCO-3D : Discovering and segmenting Sub-Concepts from Open-vocabulary queries in NeRF
By: Doriand Petit , Steve Bourgeois , Vincent Gay-Bellile and more
Potential Business Impact:
Lets robots understand objects they've never seen.
3D semantic segmentation provides high-level scene understanding for applications in robotics, autonomous systems, \textit{etc}. Traditional methods adapt exclusively to either task-specific goals (open-vocabulary segmentation) or scene content (unsupervised semantic segmentation). We propose DiSCO-3D, the first method addressing the broader problem of 3D Open-Vocabulary Sub-concepts Discovery, which aims to provide a 3D semantic segmentation that adapts to both the scene and user queries. We build DiSCO-3D on Neural Fields representations, combining unsupervised segmentation with weak open-vocabulary guidance. Our evaluations demonstrate that DiSCO-3D achieves effective performance in Open-Vocabulary Sub-concepts Discovery and exhibits state-of-the-art results in the edge cases of both open-vocabulary and unsupervised segmentation.
Similar Papers
Details Matter for Indoor Open-vocabulary 3D Instance Segmentation
CV and Pattern Recognition
Helps robots see and name objects in 3D.
OpenTrack3D: Towards Accurate and Generalizable Open-Vocabulary 3D Instance Segmentation
CV and Pattern Recognition
Lets robots understand and find any object.
Open-Vocabulary Semantic Part Segmentation of 3D Human
CV and Pattern Recognition
Lets computers understand 3D people's body parts.