A Multi-Armed Bandit Approach to Online Selection and Evaluation of   Generative Models

Xiaoyan Hu; Ho-fung Leung; and Farzan Farnia

arXiv:2406.07451·cs.LG·March 12, 2025

A Multi-Armed Bandit Approach to Online Selection and Evaluation of Generative Models

Xiaoyan Hu, Ho-fung Leung, and Farzan Farnia

PDF

Open Access

TL;DR

This paper introduces an online, sample-efficient framework for evaluating and selecting the best generative models using multi-armed bandit algorithms, specifically UCB variants, to optimize evaluation metrics like Fréchet Distance and Inception Score.

Contribution

It proposes a novel online evaluation method for generative models using MAB algorithms, with theoretical regret bounds and empirical validation on image datasets.

Findings

01

MAB algorithms effectively identify high-quality generative models with fewer samples.

02

The proposed FD-UCB and IS-UCB algorithms outperform baseline methods in sample efficiency.

03

Empirical results demonstrate the approach's potential for practical model selection in generative modeling.

Abstract

Existing frameworks for evaluating and comparing generative models consider an offline setting, where the evaluator has access to large batches of data produced by the models. However, in practical scenarios, the goal is often to identify and select the best model using the fewest possible generated samples to minimize the costs of querying data from the sub-optimal models. In this work, we propose an online evaluation and selection framework to find the generative model that maximizes a standard assessment score among a group of available models. We view the task as a multi-armed bandit (MAB) and propose upper confidence bound (UCB) bandit algorithms to identify the model producing data with the best evaluation score that quantifies the quality and diversity of generated data. Specifically, we develop the MAB-based selection of generative models considering the Fr\'echet Distance (FD)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSimulation Techniques and Applications