SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering
By: Byeongjun Park , Hyojun Go , Hyelin Nam and more
Potential Business Impact:
Makes 3D videos look more real and connected.
Recent progress in 3D/4D scene generation emphasizes the importance of physical alignment throughout video generation and scene reconstruction. However, existing methods improve the alignment separately at each stage, making it difficult to manage subtle misalignments arising from another stage. Here, we present SteerX, a zero-shot inference-time steering method that unifies scene reconstruction into the generation process, tilting data distributions toward better geometric alignment. To this end, we introduce two geometric reward functions for 3D/4D scene generation by using pose-free feed-forward scene reconstruction models. Through extensive experiments, we demonstrate the effectiveness of SteerX in improving 3D/4D scene generation.
Similar Papers
SteerPose: Simultaneous Extrinsic Camera Calibration and Matching from Articulation
CV and Pattern Recognition
Lets cameras track moving people and animals.
Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
CV and Pattern Recognition
Turns regular videos into 3D moving worlds.
Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image
CV and Pattern Recognition
Makes one picture move and change like a video.