Prompt-based Consistent Video Colorization
By: Silvia Dani, Tiberio Uricchio, Lorenzo Seidenari
Potential Business Impact:
Makes old black and white movies colorful automatically.
Existing video colorization methods struggle with temporal flickering or demand extensive manual input. We propose a novel approach automating high-fidelity video colorization using rich semantic guidance derived from language and segmentation. We employ a language-conditioned diffusion model to colorize grayscale frames. Guidance is provided via automatically generated object masks and textual prompts; our primary automatic method uses a generic prompt, achieving state-of-the-art results without specific color input. Temporal stability is achieved by warping color information from previous frames using optical flow (RAFT); a correction step detects and fixes inconsistencies introduced by warping. Evaluations on standard benchmarks (DAVIS30, VIDEVO20) show our method achieves state-of-the-art performance in colorization accuracy (PSNR) and visual realism (Colorfulness, CDC), demonstrating the efficacy of automated prompt-based guidance for consistent video colorization.
Similar Papers
Progressive Image Restoration via Text-Conditioned Video Generation
CV and Pattern Recognition
Fixes blurry, dark, or low-quality pictures.
Automatic Image Colorization with Convolutional Neural Networks and Generative Adversarial Networks
CV and Pattern Recognition
Makes old black and white photos colorful again.
Automatic Image Colorization with Convolutional Neural Networks and Generative Adversarial Networks
CV and Pattern Recognition
Makes old black and white photos colorful again.