MAVIS: Multi-Objective Alignment via Inference-Time Value-Guided Selection
Jeremy Carleton, Debajoy Mukherjee, Srinivas Shakkottai, Dileep Kalathil

TL;DR
MAVIS is a lightweight, inference-time framework that enables dynamic multi-objective alignment of large language models by combining small value models, avoiding costly fine-tuning and allowing flexible trade-offs.
Contribution
MAVIS introduces a novel inference-time alignment method using small value models for multiple objectives, improving flexibility and efficiency over traditional fine-tuning approaches.
Findings
MAVIS outperforms baselines on Pareto frontiers for multi-objective alignment.
The method enables dynamic trade-offs without modifying base model weights.
Empirical results show monotonic improvement in policy with the iterative training algorithm.
Abstract
Large Language Models (LLMs) are increasingly deployed across diverse applications that demand balancing multiple, often conflicting, objectives -- such as helpfulness, harmlessness, or humor. Many traditional methods for aligning outputs to user-specific preferences require fine-tuning models for each objective or for specific preference configurations, which is computationally expensive and inflexible. We introduce \textbf{MAVIS} -- \textit{Multi-Objective Alignment via Inference-Time Value-Guided Selection} -- a lightweight inference-time alignment framework that enables dynamic control over LLM behavior without modifying the base model's weights. MAVIS trains a set of small value models, each corresponding to a distinct objective. At inference time, these value models are combined using user-specified weights to produce a tilting function that adjusts the base model's output…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning · Constraint Satisfaction and Optimization
