Quantum bandit with amplitude amplification exploration in an adversarial environment
Byungjin Cho, Yu Xiao, Pan Hui, and Daoyi Dong

TL;DR
This paper introduces a quantum-inspired bandit algorithm utilizing amplitude amplification for improved exploration in adversarial environments, demonstrating enhanced learning efficiency through simulation.
Contribution
It presents a novel quantum-inspired approach combining amplitude amplification and probabilistic action selection for adversarial bandit problems.
Findings
Effective in adversarial settings
Improved exploration-exploitation balance
Verified via simulation results
Abstract
The rapid proliferation of learning systems in an arbitrarily changing environment mandates the need for managing tensions between exploration and exploitation. This work proposes a quantum-inspired bandit learning approach for the learning-and-adapting-based offloading problem where a client observes and learns the costs of each task offloaded to the candidate resource providers, e.g., fog nodes. In this approach, a new action update strategy and novel probabilistic action selection are adopted, provoked by the amplitude amplification and collapse postulate in quantum computation theory, respectively. We devise a locally linear mapping between a quantum-mechanical phase in a quantum domain, e.g., Grover-type search algorithm, and a distilled probability-magnitude in a value-based decision-making domain, e.g., adversarial multi-armed bandit algorithm. The proposed algorithm is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Computing Algorithms and Architecture · Blockchain Technology Applications and Security · Advanced Bandit Algorithms Research
