Novel Class Discovery for Point Cloud Segmentation via Joint Learning of Causal Representation and Reasoning
By: Yang Li , Aming Wu , Zihao Zhang and more
Potential Business Impact:
Teaches computers to identify new things in 3D scans.
In this paper, we focus on Novel Class Discovery for Point Cloud Segmentation (3D-NCD), aiming to learn a model that can segment unlabeled (novel) 3D classes using only the supervision from labeled (base) 3D classes. The key to this task is to setup the exact correlations between the point representations and their base class labels, as well as the representation correlations between the points from base and novel classes. A coarse or statistical correlation learning may lead to the confusion in novel class inference. lf we impose a causal relationship as a strong correlated constraint upon the learning process, the essential point cloud representations that accurately correspond to the classes should be uncovered. To this end, we introduce a structural causal model (SCM) to re-formalize the 3D-NCD problem and propose a new method, i.e., Joint Learning of Causal Representation and Reasoning. Specifically, we first analyze hidden confounders in the base class representations and the causal relationships between the base and novel classes through SCM. We devise a causal representation prototype that eliminates confounders to capture the causal representations of base classes. A graph structure is then used to model the causal relationships between the base classes' causal representation prototypes and the novel class prototypes, enabling causal reasoning from base to novel classes. Extensive experiments and visualization results on 3D and 2D NCD semantic segmentation demonstrate the superiorities of our method.
Similar Papers
VLM-NCD:Novel Class Discovery with Vision-Based Large Language Models
CV and Pattern Recognition
Finds new things in pictures using words.
NeurNCD: Novel Class Discovery via Implicit Neural Representation
Machine Learning (CS)
Helps computers find new things in pictures.
Generalized Zero-Shot Learning for Point Cloud Segmentation with Evidence-Based Dynamic Calibration
CV and Pattern Recognition
Helps computers identify objects they haven't seen.