RealCamo: Boosting Real Camouflage Synthesis with Layout Controls and Textual-Visual Guidance
By: Chunyuan Chen , Yunuo Cai , Shujuan Li and more
Potential Business Impact:
Creates realistic fake images for training AI.
Camouflaged image generation (CIG) has recently emerged as an efficient alternative for acquiring high-quality training data for camouflaged object detection (COD). However, existing CIG methods still suffer from a substantial gap to real camouflaged imagery: generated images either lack sufficient camouflage due to weak visual similarity, or exhibit cluttered backgrounds that are semantically inconsistent with foreground targets. To address these limitations, we propose ReamCamo, a unified out-painting based framework for realistic camouflaged image generation. ReamCamo explicitly introduces additional layout controls to regulate global image structure, thereby improving semantic coherence between foreground objects and generated backgrounds. Moreover, we construct a multi-modal textual-visual condition by combining a unified fine-grained textual task description with texture-oriented background retrieval, which jointly guides the generation process to enhance visual fidelity and realism. To quantitatively assess camouflage quality, we further introduce a background-foreground distribution divergence metric that measures the effectiveness of camouflage in generated images. Extensive experiments and visualizations demonstrate the effectiveness of our proposed framework.
Similar Papers
Foreground Focus: Enhancing Coherence and Fidelity in Camouflaged Image Generation
CV and Pattern Recognition
Makes fake pictures look real for computers.
C3Net: Context-Contrast Network for Camouflaged Object Detection
CV and Pattern Recognition
Finds hidden things that blend into backgrounds.
Zero-shot Synthetic Video Realism Enhancement via Structure-aware Denoising
CV and Pattern Recognition
Makes fake videos look like real life.