CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models
By: Hao He , Ceyuan Yang , Shanchuan Lin and more
Potential Business Impact:
Creates videos of moving scenes from any angle.
This paper introduces CameraCtrl II, a framework that enables large-scale dynamic scene exploration through a camera-controlled video diffusion model. Previous camera-conditioned video generative models suffer from diminished video dynamics and limited range of viewpoints when generating videos with large camera movement. We take an approach that progressively expands the generation of dynamic scenes -- first enhancing dynamic content within individual video clip, then extending this capability to create seamless explorations across broad viewpoint ranges. Specifically, we construct a dataset featuring a large degree of dynamics with camera parameter annotations for training while designing a lightweight camera injection module and training scheme to preserve dynamics of the pretrained models. Building on these improved single-clip techniques, we enable extended scene exploration by allowing users to iteratively specify camera trajectories for generating coherent video sequences. Experiments across diverse scenarios demonstrate that CameraCtrl Ii enables camera-controlled dynamic scene synthesis with substantially wider spatial exploration than previous approaches.
Similar Papers
DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation
CV and Pattern Recognition
Makes videos follow camera movement perfectly.
CamC2V: Context-aware Controllable Video Generation
CV and Pattern Recognition
Makes videos from pictures with camera movement.
CamCtrl3D: Single-Image Scene Exploration with Precise 3D Camera Control
CV and Pattern Recognition
Creates moving videos from one picture.