Score: 1

LASER: Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction

Published: December 15, 2025 | arXiv ID: 2512.13680v1

By: Tianye Ding , Yiming Xie , Yiqing Liang and more

Potential Business Impact:

Lets old video programs work on new ones.

Business Areas:

Image Recognition Data and Analytics, Software

Recent feed-forward reconstruction models like VGGT and $π^3$ achieve impressive reconstruction quality but cannot process streaming videos due to quadratic memory complexity, limiting their practical deployment. While existing streaming methods address this through learned memory mechanisms or causal attention, they require extensive retraining and may not fully leverage the strong geometric priors of state-of-the-art offline models. We propose LASER, a training-free framework that converts an offline reconstruction model into a streaming system by aligning predictions across consecutive temporal windows. We observe that simple similarity transformation ($\mathrm{Sim}(3)$) alignment fails due to layer depth misalignment: monocular scale ambiguity causes relative depth scales of different scene layers to vary inconsistently between windows. To address this, we introduce layer-wise scale alignment, which segments depth predictions into discrete layers, computes per-layer scale factors, and propagates them across both adjacent windows and timestamps. Extensive experiments show that LASER achieves state-of-the-art performance on camera pose estimation and point map reconstruction %quality with offline models while operating at 14 FPS with 6 GB peak memory on a RTX A6000 GPU, enabling practical deployment for kilometer-scale streaming videos. Project website: $\href{https://neu-vi.github.io/LASER/}{\texttt{https://neu-vi.github.io/LASER/}}$

Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline

CV and Pattern Recognition

Makes 3D models from videos much faster.

6 Aug 2025 2

88%

TALO: Pushing 3D Vision Foundation Models Towards Globally Consistent Online Reconstruction

CV and Pattern Recognition

Keeps 3D car views steady over time.

2 Dec 2025 1

87%

CRISTAL: Real-time Camera Registration in Static LiDAR Scans using Neural Rendering

CV and Pattern Recognition

Lets robots know exactly where they are.

20 Nov 2025 1

View PDF Login to Bookmark

Page Count

16 pages

LASER: Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction

Lets old video programs work on new ones.

Technical Abstract

Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline

TALO: Pushing 3D Vision Foundation Models Towards Globally Consistent Online Reconstruction

CRISTAL: Real-time Camera Registration in Static LiDAR Scans using Neural Rendering