Using Quantum Solved Deep Boltzmann Machines to Increase the Data   Efficiency of RL Agents

Daniel Kent; Clement O'Rourke; Jake Southall; Kirsty Duncan; Adrian; Bedford

arXiv:2408.17240·quant-ph·September 2, 2024

Using Quantum Solved Deep Boltzmann Machines to Increase the Data Efficiency of RL Agents

Daniel Kent, Clement O'Rourke, Jake Southall, Kirsty Duncan, Adrian, Bedford

PDF

Open Access

TL;DR

This paper demonstrates that integrating Quantum Solved Deep Boltzmann Machines with Proximal Policy Optimization significantly enhances data efficiency in reinforcement learning for cyber defense applications.

Contribution

It extends Deep Boltzmann Machines to reinforcement learning with quantum solutions, achieving notable data efficiency improvements in cyber defense scenarios.

Findings

01

Two-fold increase in data efficiency using quantum annealing

02

Successful integration of Deep Boltzmann Machines with PPO

03

Potential for wider adoption in data-efficient RL applications

Abstract

Deep Learning algorithms, such as those used in Reinforcement Learning, often require large quantities of data to train effectively. In most cases, the availability of data is not a significant issue. However, for some contexts, such as in autonomous cyber defence, we require data efficient methods. Recently, Quantum Machine Learning and Boltzmann Machines have been proposed as solutions to this challenge. In this work we build upon the pre-existing work to extend the use of Deep Boltzmann Machines to the cutting edge algorithm Proximal Policy Optimisation in a Reinforcement Learning cyber defence environment. We show that this approach, when solved using a D-WAVE quantum annealer, can lead to a two-fold increase in data efficiency. We therefore expect it to be used by the machine learning and quantum communities who are hoping to capitalise on data-efficient Reinforcement Learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Lattice Boltzmann Simulation Studies