Adaptive Combinatorial Experimental Design: Pareto Optimality for Decision-Making and Inference

Hongrui Xie; Junyu Cao; Kan Xu

arXiv:2602.24231·cs.LG·March 2, 2026

Adaptive Combinatorial Experimental Design: Pareto Optimality for Decision-Making and Inference

Hongrui Xie, Junyu Cao, Kan Xu

PDF

Open Access

TL;DR

This paper introduces a framework for adaptive combinatorial experimental design that balances regret minimization and statistical inference, using Pareto optimality to guide decision-making in multi-armed bandit problems.

Contribution

It formalizes the Pareto optimality concept for CMAB, proposes two algorithms with theoretical guarantees, and analyzes the impact of feedback richness on the Pareto frontier.

Findings

01

Both algorithms are Pareto optimal with finite-time guarantees.

02

Richer feedback improves the Pareto frontier and estimation accuracy.

03

The framework advances adaptive experimentation in multi-objective decision-making.

Abstract

In this paper, we provide the first investigation into adaptive combinatorial experimental design, focusing on the trade-off between regret minimization and statistical power in combinatorial multi-armed bandits (CMAB). While minimizing regret requires repeated exploitation of high-reward arms, accurate inference on reward gaps requires sufficient exploration of suboptimal actions. We formalize this trade-off through the concept of Pareto optimality and establish equivalent conditions for Pareto-efficient learning in CMAB. We consider two relevant cases under different information structures, i.e., full-bandit feedback and semi-bandit feedback, and propose two algorithms MixCombKL and MixCombUCB respectively for these two cases. We provide theoretical guarantees showing that both algorithms are Pareto optimal, achieving finite-time guarantees on both regret and estimation error of arm…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Advanced Multi-Objective Optimization Algorithms · Stochastic Gradient Optimization Techniques