Score: 0

ControlFill: Spatially Adjustable Image Inpainting from Prompt Learning

Published: March 6, 2025 | arXiv ID: 2503.04268v1

By: Boseong Jeon

Potential Business Impact:

Lets you add or remove things from pictures easily.

Business Areas:
Visual Search Internet Services

In this report, I present an inpainting framework named \textit{ControlFill}, which involves training two distinct prompts: one for generating plausible objects within a designated mask (\textit{creation}) and another for filling the region by extending the background (\textit{removal}). During the inference stage, these learned embeddings guide a diffusion network that operates without requiring heavy text encoders. By adjusting the relative significance of the two prompts and employing classifier-free guidance, users can control the intensity of removal or creation. Furthermore, I introduce a method to spatially vary the intensity of guidance by assigning different scales to individual pixels.

Page Count
12 pages

Category
Computer Science:
CV and Pattern Recognition