Quantum-Efficient Reinforcement Learning Solutions for Last-Mile On-Demand Delivery

Farzan Moosavi; Bilal Farooq

arXiv:2508.09183·quant-ph·January 29, 2026

Quantum-Efficient Reinforcement Learning Solutions for Last-Mile On-Demand Delivery

Farzan Moosavi, Bilal Farooq

PDF

TL;DR

This paper explores the use of quantum-enhanced reinforcement learning to efficiently solve large-scale last-mile delivery routing problems, demonstrating potential advantages over classical methods in complexity and scalability.

Contribution

It introduces a novel quantum-augmented RL framework with a problem-specific quantum encoding circuit for optimizing delivery routes under real-world constraints.

Findings

01

Quantum RL outperforms classical methods in solution scale

02

Proposed quantum encoding circuit improves training efficiency

03

Method effectively handles complex delivery constraints

Abstract

Quantum computation has demonstrated a promising alternative to solving the NP-hard combinatorial problems. Specifically, when it comes to optimization, classical approaches become intractable to account for large-scale solutions. Specifically, we investigate quantum computing to solve the large-scale Capacitated Pickup and Delivery Problem with Time Windows (CPDPTW). In this regard, a Reinforcement Learning (RL) framework augmented with a Parametrized Quantum Circuit (PQC) is designed to minimize the travel time in a realistic last-mile on-demand delivery. A novel problem-specific encoding quantum circuit with an entangling and variational layer is proposed. Moreover, Proximal Policy Optimization (PPO) and Quantum Singular Value Transformation (QSVT) are designed for comparison through numerical experiments, highlighting the superiority of the proposed method in terms of the scale of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.