LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents
Bingchen Li, Xin Li, Yiting Lu, Zhibo Chen

TL;DR
LossAgent leverages large language models to understand and optimize complex, customized objectives in low-level image processing tasks, surpassing traditional loss functions.
Contribution
This work introduces LossAgent, the first LLM-based loss agent capable of handling diverse and complex optimization objectives in image processing tasks.
Findings
Effective in super-resolution and restoration tasks
Supports complex perceptual and human feedback objectives
Demonstrates adaptability across multiple image processing scenarios
Abstract
We present the first loss agent, dubbed LossAgent, for low-level image processing tasks, e.g., image super-resolution and restoration, intending to achieve any customized optimization objectives of low-level image processing in different practical applications. Notably, not all optimization objectives, such as complex hand-crafted perceptual metrics, text description, and intricate human feedback, can be instantiated with existing low-level losses, e.g., MSE loss, which presents a crucial challenge in optimizing image processing networks in an end-to-end manner. To eliminate this, our LossAgent introduces the powerful large language model (LLM) as the loss agent, where the rich textual understanding of prior knowledge empowers the loss agent with the potential to understand complex optimization objectives, trajectory, and state feedback from external environments in the optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Scientific Computing and Data Management
