Score: 2

ControlVP: Interactive Geometric Refinement of AI-Generated Images with Consistent Vanishing Points

Published: December 8, 2025 | arXiv ID: 2512.07504v1

By: Ryota Okumura, Kaede Shiohara, Toshihiko Yamasaki

Potential Business Impact:

Fixes wonky lines in AI pictures.

Business Areas:
Image Recognition Data and Analytics, Software

Recent text-to-image models, such as Stable Diffusion, have achieved impressive visual quality, yet they often suffer from geometric inconsistencies that undermine the structural realism of generated scenes. One prominent issue is vanishing point inconsistency, where projections of parallel lines fail to converge correctly in 2D space. This leads to structurally implausible geometry that degrades spatial realism, especially in architectural scenes. We propose ControlVP, a user-guided framework for correcting vanishing point inconsistencies in generated images. Our approach extends a pre-trained diffusion model by incorporating structural guidance derived from building contours. We also introduce geometric constraints that explicitly encourage alignment between image edges and perspective cues. Our method enhances global geometric consistency while maintaining visual fidelity comparable to the baselines. This capability is particularly valuable for applications that require accurate spatial structure, such as image-to-3D reconstruction. The dataset and source code are available at https://github.com/RyotaOkumura/ControlVP .

Country of Origin
🇯🇵 Japan

Repos / Data Links

Page Count
14 pages

Category
Computer Science:
CV and Pattern Recognition