DiffuseSlide: Training-Free High Frame Rate Video Generation Diffusion
By: Geunmin Hwang , Hyun-kyu Ko , Younghyun Kim and more
Potential Business Impact:
Makes slow videos look super smooth and fast.
Recent advancements in diffusion models have revolutionized video generation, enabling the creation of high-quality, temporally consistent videos. However, generating high frame-rate (FPS) videos remains a significant challenge due to issues such as flickering and degradation in long sequences, particularly in fast-motion scenarios. Existing methods often suffer from computational inefficiencies and limitations in maintaining video quality over extended frames. In this paper, we present a novel, training-free approach for high FPS video generation using pre-trained diffusion models. Our method, DiffuseSlide, introduces a new pipeline that leverages key frames from low FPS videos and applies innovative techniques, including noise re-injection and sliding window latent denoising, to achieve smooth, consistent video outputs without the need for additional fine-tuning. Through extensive experiments, we demonstrate that our approach significantly improves video quality, offering enhanced temporal coherence and spatial fidelity. The proposed method is not only computationally efficient but also adaptable to various video generation tasks, making it ideal for applications such as virtual reality, video games, and high-quality content creation.
Similar Papers
TPDiff: Temporal Pyramid Video Diffusion Model
CV and Pattern Recognition
Makes video creation faster and cheaper.
Hierarchical Flow Diffusion for Efficient Frame Interpolation
CV and Pattern Recognition
Makes videos smoother and faster to create.
Fitting Image Diffusion Models on Video Datasets
CV and Pattern Recognition
Makes AI create videos that look more real.