Empirical analysis of representation learning and exploration in neural   kernel bandits

Michal Lisicki; Arash Afkanpour; Graham W. Taylor

arXiv:2111.03543·cs.LG·October 11, 2022

Empirical analysis of representation learning and exploration in neural kernel bandits

Michal Lisicki, Arash Afkanpour, Graham W. Taylor

PDF

Open Access 2 Repos

TL;DR

This paper investigates the use of neural kernels in bandit algorithms, demonstrating their effectiveness in nonlinear decision tasks and providing a framework to analyze their representation learning and exploration capabilities.

Contribution

It introduces NK-based bandits that outperform existing methods, and proposes a framework to evaluate their representation learning and exploration abilities.

Findings

01

NK bandits achieve state-of-the-art performance on nonlinear data

02

The framework separates representation learning from exploration

03

Training frequency and model partitioning affect performance

Abstract

Neural bandits have been shown to provide an efficient solution to practical sequential decision tasks that have nonlinear reward functions. The main contributor to that success is approximate Bayesian inference, which enables neural network (NN) training with uncertainty estimates. However, Bayesian NNs often suffer from a prohibitive computational overhead or operate on a subset of parameters. Alternatively, certain classes of infinite neural networks were shown to directly correspond to Gaussian processes (GP) with neural kernels (NK). NK-GPs provide accurate uncertainty estimates and can be trained faster than most Bayesian NNs. We propose to guide common bandit policies with NK distributions and show that NK bandits achieve state-of-the-art performance on nonlinear structured data. Moreover, we propose a framework for measuring independently the ability of a bandit algorithm to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Advanced Bandit Algorithms Research · Machine Learning and Data Classification

MethodsGreedy Policy Search · Gaussian Process