Quantum-Inspired Episode Selection for Monte Carlo Reinforcement Learning via QUBO Optimization

Hadi Salloum; Ali Jnadi; Yaroslav Kholodov; Alexander Gasnikov

arXiv:2601.17570·cs.LG·January 27, 2026

Quantum-Inspired Episode Selection for Monte Carlo Reinforcement Learning via QUBO Optimization

Hadi Salloum, Ali Jnadi, Yaroslav Kholodov, Alexander Gasnikov

PDF

Open Access

TL;DR

This paper introduces MC+QUBO, a quantum-inspired episode selection method for Monte Carlo reinforcement learning that improves convergence and policy quality by optimizing episode subsets with QUBO solvers.

Contribution

It reformulates episode selection as a QUBO problem and demonstrates the effectiveness of quantum-inspired samplers in enhancing reinforcement learning performance.

Findings

01

MC+QUBO outperforms vanilla Monte Carlo in convergence speed.

02

QUBO-based selection improves final policy quality.

03

Quantum-inspired optimization enhances decision-making in RL.

Abstract

Monte Carlo (MC) reinforcement learning suffers from high sample complexity, especially in environments with sparse rewards, large state spaces, and correlated trajectories. We address these limitations by reformulating episode selection as a Quadratic Unconstrained Binary Optimization (QUBO) problem and solving it with quantum-inspired samplers. Our method, MC+QUBO, integrates a combinatorial filtering step into standard MC policy evaluation: from each batch of trajectories, we select a subset that maximizes cumulative reward while promoting state-space coverage. This selection is encoded as a QUBO, where linear terms favor high-reward episodes and quadratic terms penalize redundancy. We explore both Simulated Quantum Annealing (SQA) and Simulated Bifurcation (SB) as black-box solvers within this framework. Experiments in a finite-horizon GridWorld demonstrate that MC+QUBO outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsQuantum Computing Algorithms and Architecture · Quantum many-body systems · Quantum Information and Cryptography