Faster, Cheaper, Better: Multi-Objective Hyperparameter Optimization for   LLM and RAG Systems

Matthew Barker; Andrew Bell; Evan Thomas; James Carr; Thomas Andrews; and Umang Bhatt

arXiv:2502.18635·cs.LG·May 9, 2025

Faster, Cheaper, Better: Multi-Objective Hyperparameter Optimization for LLM and RAG Systems

Matthew Barker, Andrew Bell, Evan Thomas, James Carr, Thomas Andrews, and Umang Bhatt

PDF

Open Access 1 Datasets

TL;DR

This paper introduces a multi-objective hyperparameter optimization method for RAG and LLM systems, improving cost, latency, safety, and alignment, and demonstrating superior performance over baselines on new benchmarks.

Contribution

It presents the first multi-objective Bayesian optimization approach for tuning entire RAG and LLM systems across multiple conflicting objectives.

Findings

01

Bayesian optimization outperforms baseline methods

02

Achieves better Pareto fronts on RAG benchmarks

03

Highlights task-specific and objective-specific configuration nuances

Abstract

While Retrieval Augmented Generation (RAG) has emerged as a popular technique for improving Large Language Model (LLM) systems, it introduces a large number of choices, parameters and hyperparameters that must be made or tuned. This includes the LLM, embedding, and ranker models themselves, as well as hyperparameters governing individual RAG components. Yet, collectively optimizing the entire configuration in a RAG or LLM system remains under-explored - especially in multi-objective settings - due to intractably large solution spaces, noisy objective evaluations, and the high cost of evaluations. In this work, we introduce the first approach for multi-objective parameter optimization of cost, latency, safety and alignment over entire LLM and RAG systems. We find that Bayesian optimization methods significantly outperform baseline approaches, obtaining a superior Pareto front on two new…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Trustwise/optimization-benchmark-dataset
dataset· 8 dl
8 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Artificial Intelligence in Healthcare and Education

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Warmup With Linear Decay · Dropout · Layer Normalization · Byte Pair Encoding · Attention Dropout · Softmax · Residual Connection · WordPiece