MOSAIC: Generating Consistent, Privacy-Preserving Scenes from Multiple Depth Views in Multi-Room Environments
By: Zhixuan Liu , Haokun Zhu , Rui Chen and more
Potential Business Impact:
Creates private 3D copies of rooms from pictures.
We introduce a novel diffusion-based approach for generating privacy-preserving digital twins of multi-room indoor environments from depth images only. Central to our approach is a novel Multi-view Overlapped Scene Alignment with Implicit Consistency (MOSAIC) model that explicitly considers cross-view dependencies within the same scene in the probabilistic sense. MOSAIC operates through a novel inference-time optimization that avoids error accumulation common in sequential or single-room constraint in panorama-based approaches. MOSAIC scales to complex scenes with zero extra training and provably reduces the variance during denoising processes when more overlapping views are added, leading to improved generation quality. Experiments show that MOSAIC outperforms state-of-the-art baselines on image fidelity metrics in reconstructing complex multi-room environments. Project page is available at: https://mosaic-cmubig.github.io
Similar Papers
MOSAIC: Multi-Subject Personalized Generation via Correspondence-Aware Alignment and Disentanglement
CV and Pattern Recognition
Creates realistic pictures with many people.
MOSAIC: Modeling Social AI for Content Dissemination and Regulation in Multi-Agent Simulations
Computation and Language
Simulates online behavior to fight fake news.
MOSAIC: A Skill-Centric Algorithmic Framework for Long-Horizon Manipulation Planning
Robotics
Robots learn to do many tasks by combining simple moves.