Interpretable Single-View 3D Gaussian Splatting using Unsupervised Hierarchical Disentangled Representation Learning
By: Yuyang Zhang , Baao Xie , Hu Zhu and more
Potential Business Impact:
Makes 3D pictures understandable for computers.
Gaussian Splatting (GS) has recently marked a significant advancement in 3D reconstruction, delivering both rapid rendering and high-quality results. However, existing 3DGS methods pose challenges in understanding underlying 3D semantics, which hinders model controllability and interpretability. To address it, we propose an interpretable single-view 3DGS framework, termed 3DisGS, to discover both coarse- and fine-grained 3D semantics via hierarchical disentangled representation learning (DRL). Specifically, the model employs a dual-branch architecture, consisting of a point cloud initialization branch and a triplane-Gaussian generation branch, to achieve coarse-grained disentanglement by separating 3D geometry and visual appearance features. Subsequently, fine-grained semantic representations within each modality are further discovered through DRL-based encoder-adapters. To our knowledge, this is the first work to achieve unsupervised interpretable 3DGS. Evaluations indicate that our model achieves 3D disentanglement while preserving high-quality and rapid reconstruction.
Similar Papers
SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining
CV and Pattern Recognition
Teaches computers to understand 3D spaces from scans.
Hi-LSplat: Hierarchical 3D Language Gaussian Splatting
CV and Pattern Recognition
Lets computers understand 3D objects from words.
View-Dependent Uncertainty Estimation of 3D Gaussian Splatting
CV and Pattern Recognition
Shows how sure a 3D picture is from any angle.