Multiobjective Vehicle Routing Optimization with Time Windows: A Hybrid Approach Using Deep Reinforcement Learning and NSGA-II
Rixin Wu, Ran Wang, Jie Hao, Qiang Wu, Ping Wang, Dusit Niyato

TL;DR
This paper introduces a hybrid approach combining deep reinforcement learning and NSGA-II to efficiently solve multiobjective vehicle routing problems with time windows, improving solution quality and reducing training time.
Contribution
A novel weight-aware deep reinforcement learning framework with a transformer-based policy network for multiobjective vehicle routing, integrated with NSGA-II for enhanced optimization.
Findings
Outperforms traditional methods in solution quality.
Reduces training and solution generation time.
Improves scalability of multiobjective VRP solutions.
Abstract
This paper proposes a weight-aware deep reinforcement learning (WADRL) approach designed to address the multiobjective vehicle routing problem with time windows (MOVRPTW), aiming to use a single deep reinforcement learning (DRL) model to solve the entire multiobjective optimization problem. The Non-dominated sorting genetic algorithm-II (NSGA-II) method is then employed to optimize the outcomes produced by the WADRL, thereby mitigating the limitations of both approaches. Firstly, we design an MOVRPTW model to balance the minimization of travel cost and the maximization of customer satisfaction. Subsequently, we present a novel DRL framework that incorporates a transformer-based policy network. This network is composed of an encoder module, a weight embedding module where the weights of the objective functions are incorporated, and a decoder module. NSGA-II is then utilized to optimize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Vehicular Ad Hoc Networks (VANETs) · Modular Robots and Swarm Intelligence
MethodsEmirates Airlines Office in Dubai
