Fast online inference for nonlinear contextual bandit based on   Generative Adversarial Network

Yun Da Tsai; Shou De Lin

arXiv:2202.08867·cs.LG·February 21, 2022

Fast online inference for nonlinear contextual bandit based on Generative Adversarial Network

Yun Da Tsai, Shou De Lin

PDF

Open Access

TL;DR

This paper introduces a neural bandit model utilizing generative adversarial networks to achieve fast, approximate inference in nonlinear contextual bandits with large arm sets, significantly reducing inference time.

Contribution

It proposes a novel generative adversarial network approach to shift the computational bottleneck from inference to training, enabling efficient batch and parallel processing for large-scale bandit problems.

Findings

01

Achieves $O(\log n)$ inference time complexity.

02

Demonstrates order-of-magnitude speedup in experiments.

03

Maintains comparable performance to existing methods.

Abstract

This work addresses the efficiency concern on inferring a nonlinear contextual bandit when the number of arms $n$ is very large. We propose a neural bandit model with an end-to-end training process to efficiently perform bandit algorithms such as Thompson Sampling and UCB during inference. We advance state-of-the-art time complexity to $O (lo g n)$ with approximate Bayesian inference, neural random feature mapping, approximate global maxima and approximate nearest neighbor search. We further propose a generative adversarial network to shift the bottleneck of maximizing the objective for selecting optimal arms from inference time to training time, enjoying significant speedup with additional advantage of enabling batch and parallel processing. %The generative model can inference an approximate argmax of the posterior sampling in logarithmic time complexity with the help of approximate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning