Muses: Designing, Composing, Generating Nonexistent Fantasy 3D Creatures without Training
By: Hexiao Lu , Xiaokun Sun , Zeyu Cai and more
Potential Business Impact:
Creates amazing 3D creatures from scratch.
We present Muses, the first training-free method for fantastic 3D creature generation in a feed-forward paradigm. Previous methods, which rely on part-aware optimization, manual assembly, or 2D image generation, often produce unrealistic or incoherent 3D assets due to the challenges of intricate part-level manipulation and limited out-of-domain generation. In contrast, Muses leverages the 3D skeleton, a fundamental representation of biological forms, to explicitly and rationally compose diverse elements. This skeletal foundation formalizes 3D content creation as a structure-aware pipeline of design, composition, and generation. Muses begins by constructing a creatively composed 3D skeleton with coherent layout and scale through graph-constrained reasoning. This skeleton then guides a voxel-based assembly process within a structured latent space, integrating regions from different objects. Finally, image-guided appearance modeling under skeletal conditions is applied to generate a style-consistent and harmonious texture for the assembled shape. Extensive experiments establish Muses' state-of-the-art performance in terms of visual fidelity and alignment with textual descriptions, and potential on flexible 3D object editing. Project page: https://luhexiao.github.io/Muses.github.io/.
Similar Papers
MUSE: Manipulating Unified Framework for Synthesizing Emotions in Images via Test-Time Optimization
CV and Pattern Recognition
Creates images that perfectly match feelings.
Muse: Towards Reproducible Long-Form Song Generation with Fine-Grained Style Control
Sound
Lets anyone make songs with music and words.
Muse: Towards Reproducible Long-Form Song Generation with Fine-Grained Style Control
Sound
Makes computers create songs with any style.