IN2OUT: Fine-Tuning Video Inpainting Model for Video Outpainting Using Hierarchical Discriminator
By: Sangwoo Youn , Minji Lee , Nokap Tony Park and more
Potential Business Impact:
Makes videos look bigger without looking weird.
Video outpainting presents a unique challenge of extending the borders while maintaining consistency with the given content. In this paper, we suggest the use of video inpainting models that excel in object flow learning and reconstruction in outpainting rather than solely generating the background as in existing methods. However, directly applying or fine-tuning inpainting models to outpainting has shown to be ineffective, often leading to blurry results. Our extensive experiments on discriminator designs reveal that a critical component missing in the outpainting fine-tuning process is a discriminator capable of effectively assessing the perceptual quality of the extended areas. To tackle this limitation, we differentiate the objectives of adversarial training into global and local goals and introduce a hierarchical discriminator that meets both objectives. Additionally, we develop a specialized outpainting loss function that leverages both local and global features of the discriminator. Fine-tuning on this adversarial loss function enhances the generator's ability to produce both visually appealing and globally coherent outpainted scenes. Our proposed method outperforms state-of-the-art methods both quantitatively and qualitatively. Supplementary materials including the demo video and the code are available in SigPort.
Similar Papers
Unified Long Video Inpainting and Outpainting via Overlapping High-Order Co-Denoising
CV and Pattern Recognition
Makes videos longer and edits them perfectly.
From Inpainting to Layer Decomposition: Repurposing Generative Inpainting Models for Image Layer Decomposition
CV and Pattern Recognition
Lets you edit parts of a picture separately.
FreeInpaint: Tuning-free Prompt Alignment and Visual Rationality Enhancement in Image Inpainting
CV and Pattern Recognition
Makes AI fill in missing picture parts from your words.