Multi-armed quantum bandits: Exploration versus exploitation when   learning properties of quantum states

Josep Lumbreras; Erkka Haapasalo; Marco Tomamichel

arXiv:2108.13050·quant-ph·June 29, 2022·Quantum

Multi-armed quantum bandits: Exploration versus exploitation when learning properties of quantum states

Josep Lumbreras, Erkka Haapasalo, Marco Tomamichel

PDF

TL;DR

This paper explores the balance between exploration and exploitation in learning properties of unknown quantum states, providing theoretical bounds on regret and strategies for optimal learning in quantum bandit settings.

Contribution

It introduces the first information-theoretic lower bounds on regret in quantum bandits and proposes optimal strategies for finite actions and mixed states.

Findings

01

Regret scales at least as the square root of the number of rounds

02

Lower bounds depend on number of actions and space dimension

03

Strategies are optimal for finite arms and mixed states

Abstract

We initiate the study of tradeoffs between exploration and exploitation in online learning of properties of quantum states. Given sequential oracle access to an unknown quantum state, in each round, we are tasked to choose an observable from a set of actions aiming to maximize its expectation value on the state (the reward). Information gained about the unknown state from previous rounds can be used to gradually improve the choice of action, thus reducing the gap between the reward and the maximal reward attainable with the given action set (the regret). We provide various information-theoretic lower bounds on the cumulative regret that an optimal learner must incur, and show that it scales at least as the square root of the number of rounds played. We also investigate the dependence of the cumulative regret on the number of available actions and the dimension of the underlying space.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.