Score: 0

Text To 3D Object Generation For Scalable Room Assembly

Published: April 12, 2025 | arXiv ID: 2504.09328v1

By: Sonia Laguna , Alberto Garcia-Garcia , Marie-Julie Rakotosaona and more

Potential Business Impact:

Creates realistic 3D scenes from text for AI.

Business Areas:
Text Analytics Data and Analytics, Software

Modern machine learning models for scene understanding, such as depth estimation and object tracking, rely on large, high-quality datasets that mimic real-world deployment scenarios. To address data scarcity, we propose an end-to-end system for synthetic data generation for scalable, high-quality, and customizable 3D indoor scenes. By integrating and adapting text-to-image and multi-view diffusion models with Neural Radiance Field-based meshing, this system generates highfidelity 3D object assets from text prompts and incorporates them into pre-defined floor plans using a rendering tool. By introducing novel loss functions and training strategies into existing methods, the system supports on-demand scene generation, aiming to alleviate the scarcity of current available data, generally manually crafted by artists. This system advances the role of synthetic data in addressing machine learning training limitations, enabling more robust and generalizable models for real-world applications.

Country of Origin
🇨🇭 Switzerland

Page Count
15 pages

Category
Computer Science:
CV and Pattern Recognition