Score: 1

PhotoArtAgent: Intelligent Photo Retouching with Language Model-Based Artist Agents

Published: May 29, 2025 | arXiv ID: 2505.23130v1

By: Haoyu Chen , Keda Tao , Yizao Wang and more

Potential Business Impact:

Makes computers edit photos like artists.

Business Areas:
Photo Editing Content and Publishing, Media and Entertainment

Photo retouching is integral to photographic art, extending far beyond simple technical fixes to heighten emotional expression and narrative depth. While artists leverage expertise to create unique visual effects through deliberate adjustments, non-professional users often rely on automated tools that produce visually pleasing results but lack interpretative depth and interactive transparency. In this paper, we introduce PhotoArtAgent, an intelligent system that combines Vision-Language Models (VLMs) with advanced natural language reasoning to emulate the creative process of a professional artist. The agent performs explicit artistic analysis, plans retouching strategies, and outputs precise parameters to Lightroom through an API. It then evaluates the resulting images and iteratively refines them until the desired artistic vision is achieved. Throughout this process, PhotoArtAgent provides transparent, text-based explanations of its creative rationale, fostering meaningful interaction and user control. Experimental results show that PhotoArtAgent not only surpasses existing automated tools in user studies but also achieves results comparable to those of professional human artists.

Country of Origin
🇨🇳 🇦🇺 China, Australia

Page Count
31 pages

Category
Computer Science:
CV and Pattern Recognition