Vehicle-as-Prompt: A Unified Deep Reinforcement Learning Framework for Heterogeneous Fleet Vehicle Routing Problem

Shihong Huang; Shengjie Wang; Lei Gao; Hong Ma; Zhanluo Zhang; Feng Zhang; Weihua Zhou

arXiv:2604.05195·cs.LG·April 8, 2026

Vehicle-as-Prompt: A Unified Deep Reinforcement Learning Framework for Heterogeneous Fleet Vehicle Routing Problem

Shihong Huang, Shengjie Wang, Lei Gao, Hong Ma, Zhanluo Zhang, Feng Zhang, Weihua Zhou

PDF

TL;DR

This paper introduces Vehicle-as-Prompt, a unified deep reinforcement learning framework that effectively solves complex heterogeneous fleet vehicle routing problems with diverse constraints, outperforming existing methods in solution quality and speed.

Contribution

The paper proposes the Vehicle-as-Prompt mechanism and VaP-CSMV framework, enabling a single model to handle various HFVRP variants with improved accuracy and efficiency.

Findings

01

VaP-CSMV outperforms existing DRL-based solvers.

02

Achieves solution quality comparable to traditional heuristics.

03

Reduces inference time to seconds and generalizes zero-shot.

Abstract

Unlike traditional homogeneous routing problems, the Heterogeneous Fleet Vehicle Routing Problem (HFVRP) involves heterogeneous fixed costs, variable travel costs, and capacity constraints, rendering solution quality highly sensitive to vehicle selection. Furthermore, real-world logistics applications often impose additional complex constraints, markedly increasing computational complexity. However, most existing Deep Reinforcement Learning (DRL)-based methods are restricted to homogeneous scenarios, leading to suboptimal performance when applied to HFVRP and its complex variants. To bridge this gap, we investigate HFVRP under complex constraints and develop a unified DRL framework capable of solving the problem across various variant settings. We introduce the Vehicle-as-Prompt (VaP) mechanism, which formulates the problem as a single-stage autoregressive decision process. Building on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.