PhotoArtAgent: Intelligent Photo Retouching with Language Model-Based Artist Agents

Haoyu Chen; Keda Tao; Yizao Wang; Xinlei Wang; Lei Zhu; Jinjin Gu

arXiv:2505.23130·cs.CV·May 30, 2025

PhotoArtAgent: Intelligent Photo Retouching with Language Model-Based Artist Agents

Haoyu Chen, Keda Tao, Yizao Wang, Xinlei Wang, Lei Zhu, Jinjin Gu

PDF

TL;DR

PhotoArtAgent is an AI system that combines vision-language models and natural language reasoning to emulate professional artistic retouching, providing transparent, iterative, and user-controllable photo enhancement.

Contribution

It introduces a novel AI agent that plans, executes, and explains artistic photo retouching using vision-language models and API-driven adjustments.

Findings

01

Outperforms existing automated tools in user studies

02

Achieves results comparable to professional artists

03

Provides transparent, explainable retouching process

Abstract

Photo retouching is integral to photographic art, extending far beyond simple technical fixes to heighten emotional expression and narrative depth. While artists leverage expertise to create unique visual effects through deliberate adjustments, non-professional users often rely on automated tools that produce visually pleasing results but lack interpretative depth and interactive transparency. In this paper, we introduce PhotoArtAgent, an intelligent system that combines Vision-Language Models (VLMs) with advanced natural language reasoning to emulate the creative process of a professional artist. The agent performs explicit artistic analysis, plans retouching strategies, and outputs precise parameters to Lightroom through an API. It then evaluates the resulting images and iteratively refines them until the desired artistic vision is achieved. Throughout this process, PhotoArtAgent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.