Mask Consistency Regularization in Object Removal
By: Hua Yuan , Jin Yuan , Yicheng Jiang and more
Potential Business Impact:
Removes unwanted objects from pictures perfectly.
Object removal, a challenging task within image inpainting, involves seamlessly filling the removed region with content that matches the surrounding context. Despite advancements in diffusion models, current methods still face two critical challenges. The first is mask hallucination, where the model generates irrelevant or spurious content inside the masked region, and the second is mask-shape bias, where the model fills the masked area with an object that mimics the mask's shape rather than surrounding content. To address these issues, we propose Mask Consistency Regularization (MCR), a novel training strategy designed specifically for object removal tasks. During training, our approach introduces two mask perturbations: dilation and reshape, enforcing consistency between the outputs of these perturbed branches and the original mask. The dilated masks help align the model's output with the surrounding content, while reshaped masks encourage the model to break the mask-shape bias. This combination of strategies enables MCR to produce more robust and contextually coherent inpainting results. Our experiments demonstrate that MCR significantly reduces hallucinations and mask-shape bias, leading to improved performance in object removal.
Similar Papers
Promoting Shape Bias in CNNs: Frequency-Based and Contrastive Regularization for Corruption Robustness
CV and Pattern Recognition
Makes computers see objects even when they're blurry.
Abstain Mask Retain Core: Time Series Prediction by Adaptive Masking Loss with Representation Consistency
Machine Learning (CS)
Makes predictions better by ignoring extra noise.
What Shape Is Optimal for Masks in Text Removal?
CV and Pattern Recognition
Cleans text from pictures, even messy ones.