Score: 0

SceneMaker: Open-set 3D Scene Generation with Decoupled De-occlusion and Pose Estimation Model

Published: December 11, 2025 | arXiv ID: 2512.10957v1

By: Yukai Shi , Weiyu Li , Zihao Wang and more

Potential Business Impact:

Makes 3D scenes with hidden objects visible.

Business Areas:

Motion Capture Media and Entertainment, Video

We propose a decoupled 3D scene generation framework called SceneMaker in this work. Due to the lack of sufficient open-set de-occlusion and pose estimation priors, existing methods struggle to simultaneously produce high-quality geometry and accurate poses under severe occlusion and open-set settings. To address these issues, we first decouple the de-occlusion model from 3D object generation, and enhance it by leveraging image datasets and collected de-occlusion datasets for much more diverse open-set occlusion patterns. Then, we propose a unified pose estimation model that integrates global and local mechanisms for both self-attention and cross-attention to improve accuracy. Besides, we construct an open-set 3D scene dataset to further extend the generalization of the pose estimation model. Comprehensive experiments demonstrate the superiority of our decoupled framework on both indoor and open-set scenes. Our codes and datasets is released at https://idea-research.github.io/SceneMaker/.

SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation

CV and Pattern Recognition

Moves and changes many objects in pictures easily.

20 Nov 2025 0

87%

SceneDecorator: Towards Scene-Oriented Story Generation with Scene Planning and Scene Consistency

CV and Pattern Recognition

Creates consistent scenes for stories and games.

27 Oct 2025 0

86%

WALDO: Where Unseen Model-based 6D Pose Estimation Meets Occlusion

CV and Pattern Recognition

Helps robots see objects even when they're partly hidden.

19 Nov 2025 0

View PDF Login to Bookmark

Page Count

15 pages

SceneMaker: Open-set 3D Scene Generation with Decoupled De-occlusion and Pose Estimation Model

Makes 3D scenes with hidden objects visible.

Technical Abstract

SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation

SceneDecorator: Towards Scene-Oriented Story Generation with Scene Planning and Scene Consistency

WALDO: Where Unseen Model-based 6D Pose Estimation Meets Occlusion