OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes
By: Yukun Huang , Jiwen Yu , Yanning Zhou and more
Potential Business Impact:
Creates realistic 3D worlds from 2D pictures.
There are two prevalent ways to constructing 3D scenes: procedural generation and 2D lifting. Among them, panorama-based 2D lifting has emerged as a promising technique, leveraging powerful 2D generative priors to produce immersive, realistic, and diverse 3D environments. In this work, we advance this technique to generate graphics-ready 3D scenes suitable for physically based rendering (PBR), relighting, and simulation. Our key insight is to repurpose 2D generative models for panoramic perception of geometry, textures, and PBR materials. Unlike existing 2D lifting approaches that emphasize appearance generation and ignore the perception of intrinsic properties, we present OmniX, a versatile and unified framework. Based on a lightweight and efficient cross-modal adapter structure, OmniX reuses 2D generative priors for a broad range of panoramic vision tasks, including panoramic perception, generation, and completion. Furthermore, we construct a large-scale synthetic panorama dataset containing high-quality multimodal panoramas from diverse indoor and outdoor scenes. Extensive experiments demonstrate the effectiveness of our model in panoramic visual perception and graphics-ready 3D scene generation, opening new possibilities for immersive and physically realistic virtual world generation.
Similar Papers
Matrix-3D: Omnidirectional Explorable 3D World Generation
CV and Pattern Recognition
Creates 3D worlds from one picture or words.
Omni-View: Unlocking How Generation Facilitates Understanding in Unified 3D Model based on Multiview images
CV and Pattern Recognition
Builds 3D worlds from many pictures.
TiP4GEN: Text to Immersive Panorama 4D Scene Generation
CV and Pattern Recognition
Creates 360-degree moving virtual worlds from text.