CoRe3D: Collaborative Reasoning as a Foundation for 3D Intelligence
By: Tianjiao Yu , Xinzhuo Li , Yifan Shen and more
Potential Business Impact:
Builds 3D worlds from words.
Recent advances in large multimodal models suggest that explicit reasoning mechanisms play a critical role in improving model reliability, interpretability, and cross-modal alignment. While such reasoning-centric approaches have been proven effective in language and vision tasks, their extension to 3D remains underdeveloped. CoRe3D introduces a unified 3D understanding and generation reasoning framework that jointly operates over semantic and spatial abstractions, enabling high-level intent inferred from language to directly guide low-level 3D content formation. Central to this design is a spatially grounded reasoning representation that decomposes 3D latent space into localized regions, allowing the model to reason over geometry in a compositional and procedural manner. By tightly coupling semantic chain-of-thought inference with structured spatial reasoning, CoRe3D produces 3D outputs that exhibit strong local consistency and faithful alignment with linguistic descriptions.
Similar Papers
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views
CV and Pattern Recognition
Lets computers imagine 3D shapes from pictures.
COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence
CV and Pattern Recognition
Helps computers understand 3D space better.
COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence
CV and Pattern Recognition
Teaches computers to understand 3D shapes and space.