PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design
By: Jiazhe Wei , Ken Li , Tianyu Lao and more
Potential Business Impact:
Creates amazing posters with easy editing.
Graphic design forms the cornerstone of modern visual communication, serving as a vital medium for promoting cultural and commercial events. Recent advances have explored automating this process using Large Multimodal Models (LMMs), yet existing methods often produce geometrically inaccurate layouts and lack the iterative, layer-specific editing required in professional workflows. To address these limitations, we present PosterCopilot, a framework that advances layout reasoning and controllable editing for professional graphic design. Specifically, we introduce a progressive three-stage training strategy that equips LMMs with geometric understanding and aesthetic reasoning for layout design, consisting of Perturbed Supervised Fine-Tuning, Reinforcement Learning for Visual-Reality Alignment, and Reinforcement Learning from Aesthetic Feedback. Furthermore, we develop a complete workflow that couples the trained LMM-based design model with generative models, enabling layer-controllable, iterative editing for precise element refinement while maintaining global visual consistency. Extensive experiments demonstrate that PosterCopilot achieves geometrically accurate and aesthetically superior layouts, offering unprecedented controllability for professional iterative design.
Similar Papers
CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design Generation
CV and Pattern Recognition
Makes posters from your words and pictures.
PosterGen: Aesthetic-Aware Paper-to-Poster Generation via Multi-Agent LLMs
Artificial Intelligence
Makes research posters automatically look good.
SciPostGen: Bridging the Gap between Scientific Papers and Poster Layouts
CV and Pattern Recognition
Creates better science posters from research papers.