LoG3D: Ultra-High-Resolution 3D Shape Modeling via Local-to-Global Partitioning
By: Xinran Yang , Shuichang Lai , Jiangjing Lyu and more
Potential Business Impact:
Creates detailed 3D shapes from messy data.
Generating high-fidelity 3D contents remains a fundamental challenge due to the complexity of representing arbitrary topologies-such as open surfaces and intricate internal structures-while preserving geometric details. Prevailing methods based on signed distance fields (SDFs) are hampered by costly watertight preprocessing and struggle with non-manifold geometries, while point-cloud representations often suffer from sampling artifacts and surface discontinuities. To overcome these limitations, we propose a novel 3D variational autoencoder (VAE) framework built upon unsigned distance fields (UDFs)-a more robust and computationally efficient representation that naturally handles complex and incomplete shapes. Our core innovation is a local-to-global (LoG) architecture that processes the UDF by partitioning it into uniform subvolumes, termed UBlocks. This architecture couples 3D convolutions for capturing local detail with sparse transformers for enforcing global coherence. A Pad-Average strategy further ensures smooth transitions at subvolume boundaries during reconstruction. This modular design enables seamless scaling to ultra-high resolutions up to 2048^3-a regime previously unattainable for 3D VAEs. Experiments demonstrate state-of-the-art performance in both reconstruction accuracy and generative quality, yielding superior surface smoothness and geometric flexibility.
Similar Papers
Voronoi-Assisted Diffusion for Computing Unsigned Distance Fields from Unoriented Points
CV and Pattern Recognition
Makes 3D shapes from messy dots.
Learning Compact Latent Space for Representing Neural Signed Distance Functions with High-fidelity Geometry Details
CV and Pattern Recognition
Lets computers create detailed 3D shapes from many examples.
UniLat3D: Geometry-Appearance Unified Latents for Single-Stage 3D Generation
CV and Pattern Recognition
Makes 3D objects from one picture fast.