Hybrid-Quantum Neural Architecture Search for The Proximal Policy   Optimization Algorithm

Moustafa Zada

arXiv:2501.10673·quant-ph·January 22, 2025

Hybrid-Quantum Neural Architecture Search for The Proximal Policy Optimization Algorithm

Moustafa Zada

PDF

TL;DR

This paper introduces a hybrid-quantum neural architecture search method for the PPO reinforcement learning algorithm, aiming to identify effective hybrid models and interpret their design choices amidst quantum hardware limitations.

Contribution

It applies a Regularized Evolution algorithm to optimize hybrid classical-quantum architectures for PPO and provides insights into factors influencing model performance.

Findings

01

Classical models outperformed hybrid models on the leaderboard.

02

The best hybrid model ranked eleventh among all models.

03

Insights into design factors that affect hybrid architecture performance.

Abstract

Recent studies in quantum machine learning advocated the use of hybrid models to assist with the limitations of the currently existing Noisy Intermediate Scale Quantum (NISQ) devices, but what was missing from most of them was the explanations and interpretations of the choices that were made to pick those exact architectures and the differentiation between good and bad hybrid architectures, this research attempts to tackle that gap in the literature by using the Regularized Evolution algorithm to search for the optimal hybrid classical-quantum architecture for the Proximal Policy Optimization (PPO) algorithm, a well-known reinforcement learning algorithm, ultimately the classical models dominated the leaderboard with the best hybrid model coming in eleventh place among all unique models, while we also try to explain the factors that contributed to such results,and for some models to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.