Score: 0

Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors

Published: March 17, 2025 | arXiv ID: 2503.13272v1

By: Katja Schwarz, Norman Mueller, Peter Kontschieder

Potential Business Impact:

Creates realistic 3D worlds from flat images.

Business Areas:

3D Technology Hardware, Software

Synthesizing consistent and photorealistic 3D scenes is an open problem in computer vision. Video diffusion models generate impressive videos but cannot directly synthesize 3D representations, i.e., lack 3D consistency in the generated sequences. In addition, directly training generative 3D models is challenging due to a lack of 3D training data at scale. In this work, we present Generative Gaussian Splatting (GGS) -- a novel approach that integrates a 3D representation with a pre-trained latent video diffusion model. Specifically, our model synthesizes a feature field parameterized via 3D Gaussian primitives. The feature field is then either rendered to feature maps and decoded into multi-view images, or directly upsampled into a 3D radiance field. We evaluate our approach on two common benchmark datasets for scene synthesis, RealEstate10K and ScanNet+, and find that our proposed GGS model significantly improves both the 3D consistency of the generated multi-view images, and the quality of the generated 3D scenes over all relevant baselines. Compared to a similar model without 3D representation, GGS improves FID on the generated 3D scenes by ~20% on both RealEstate10K and ScanNet+. Project page: https://katjaschwarz.github.io/ggs/

Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis

CV and Pattern Recognition

Creates realistic 3D worlds from few pictures.

2 Apr 2025 0

92%

Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs

CV and Pattern Recognition

Makes 3D pictures from few photos.

7 Mar 2025 3

92%

GSV3D: Gaussian Splatting-based Geometric Distillation with Stable Video Diffusion for Single-Image 3D Object Generation

CV and Pattern Recognition

Creates 3D objects from pictures, making them look real.

8 Mar 2025 1

View PDF Login to Bookmark

Page Count

18 pages

Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors

Creates realistic 3D worlds from flat images.

Technical Abstract

Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis

Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs

GSV3D: Gaussian Splatting-based Geometric Distillation with Stable Video Diffusion for Single-Image 3D Object Generation