MO-CAPO: Multi-Objective Cost-Aware Prompt Optimization
Jan B\"ussing, Moritz Schlager, Timo Hei{\ss}, Tom Zehle, and Matthias Feurer

TL;DR
MO-CAPO is a multi-objective prompt optimization algorithm for large language models that balances performance and inference cost efficiently, producing diverse solutions suitable for practical deployment.
Contribution
It introduces a novel multi-objective optimization method that jointly considers performance and cost, leveraging budget allocation and a deployment-oriented cost objective.
Findings
Outperforms NSGA-II baseline on 8 out of 12 cases in noisy R2 metric.
Achieves competitive performance at lower budgets.
Provides diverse Pareto front solutions balancing performance and cost.
Abstract
Large language models (LLMs) achieve strong performance across a wide range of tasks but are highly sensitive to prompt design, motivating the need for automatic prompt optimization. Existing methods predominantly focus on performance alone, ignoring competing objectives such as inference cost or latency. At the same time, existing work on multi-objective prompt optimization relies on off-the-shelf NSGA-II, ignoring optimization efficiency. As a remedy, we introduce MO-CAPO, a novel multi-objective prompt optimization algorithm that jointly optimizes performance and inference cost while leveraging budget allocation for cost-efficient optimization. We further propose a deployment-oriented cost objective that captures the full computational profile of LLM inference. We evaluate our approach across four tasks and three LLMs and compare it to an NSGA-II-based multi-objective method and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
