Score: 0

Flux4D: Flow-based Unsupervised 4D Reconstruction

Published: December 2, 2025 | arXiv ID: 2512.03210v1

By: Jingkang Wang , Henry Che , Yun Chen and more

Potential Business Impact:

Builds 3D worlds from videos in seconds.

Business Areas:

Image Recognition Data and Analytics, Software

Reconstructing large-scale dynamic scenes from visual observations is a fundamental challenge in computer vision, with critical implications for robotics and autonomous systems. While recent differentiable rendering methods such as Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS) have achieved impressive photorealistic reconstruction, they suffer from scalability limitations and require annotations to decouple actor motion. Existing self-supervised methods attempt to eliminate explicit annotations by leveraging motion cues and geometric priors, yet they remain constrained by per-scene optimization and sensitivity to hyperparameter tuning. In this paper, we introduce Flux4D, a simple and scalable framework for 4D reconstruction of large-scale dynamic scenes. Flux4D directly predicts 3D Gaussians and their motion dynamics to reconstruct sensor observations in a fully unsupervised manner. By adopting only photometric losses and enforcing an "as static as possible" regularization, Flux4D learns to decompose dynamic elements directly from raw data without requiring pre-trained supervised models or foundational priors simply by training across many scenes. Our approach enables efficient reconstruction of dynamic scenes within seconds, scales effectively to large datasets, and generalizes well to unseen environments, including rare and unknown objects. Experiments on outdoor driving datasets show Flux4D significantly outperforms existing methods in scalability, generalization, and reconstruction quality.

DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images

CV and Pattern Recognition

Lets self-driving cars see and remember 3D scenes.

2 Dec 2025 2

90%

4DRadar-GS: Self-Supervised Dynamic Driving Scene Reconstruction with 4D Radar

CV and Pattern Recognition

Helps self-driving cars see moving objects better.

16 Sep 2025 1

90%

Motion4D: Learning 3D-Consistent Motion and Semantics for 4D Scene Understanding

CV and Pattern Recognition

Makes videos show 3D worlds without flickering.

3 Dec 2025 0

View PDF Login to Bookmark

Page Count

15 pages

Flux4D: Flow-based Unsupervised 4D Reconstruction

Builds 3D worlds from videos in seconds.

Technical Abstract

DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images

4DRadar-GS: Self-Supervised Dynamic Driving Scene Reconstruction with 4D Radar

Motion4D: Learning 3D-Consistent Motion and Semantics for 4D Scene Understanding