MAVIS: Multi-Objective Alignment via Inference-Time Value-Guided Selection

Jeremy Carleton; Debajoy Mukherjee; Srinivas Shakkottai; Dileep Kalathil

arXiv:2508.13415·cs.LG·February 17, 2026

MAVIS: Multi-Objective Alignment via Inference-Time Value-Guided Selection

Jeremy Carleton, Debajoy Mukherjee, Srinivas Shakkottai, Dileep Kalathil

PDF

Open Access

TL;DR

MAVIS is a lightweight, inference-time framework that enables dynamic multi-objective alignment of large language models by combining small value models, avoiding costly fine-tuning and allowing flexible trade-offs.

Contribution

MAVIS introduces a novel inference-time alignment method using small value models for multiple objectives, improving flexibility and efficiency over traditional fine-tuning approaches.

Findings

01

MAVIS outperforms baselines on Pareto frontiers for multi-objective alignment.

02

The method enables dynamic trade-offs without modifying base model weights.

03

Empirical results show monotonic improvement in policy with the iterative training algorithm.

Abstract

Large Language Models (LLMs) are increasingly deployed across diverse applications that demand balancing multiple, often conflicting, objectives -- such as helpfulness, harmlessness, or humor. Many traditional methods for aligning outputs to user-specific preferences require fine-tuning models for each objective or for specific preference configurations, which is computationally expensive and inflexible. We introduce \textbf{MAVIS} -- \textit{Multi-Objective Alignment via Inference-Time Value-Guided Selection} -- a lightweight inference-time alignment framework that enables dynamic control over LLM behavior without modifying the base model's weights. MAVIS trains a set of small value models, each corresponding to a distinct objective. At inference time, these value models are combined using user-specified weights to produce a tilting function that adjusts the base model's output…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning · Constraint Satisfaction and Optimization