Multi-objective Pointer Network for Combinatorial Optimization
Le-yang Gao, Rui Wang, Chuang Liu, Zhao-hong Jia

TL;DR
This paper introduces MOPN, a deep reinforcement learning framework that efficiently solves multi-objective combinatorial optimization problems, outperforming classical methods in speed and scalability.
Contribution
The study proposes a novel single-model DRL framework, MOPN, with improved input structure and training strategies for multi-objective combinatorial optimization.
Findings
MOPN outperforms state-of-the-art models and classical meta-heuristics.
MOPN requires only 20-40% of the training time of DRL-MOA.
MOPN is insensitive to problem scale, handling different sizes effectively.
Abstract
Multi-objective combinatorial optimization problems (MOCOPs), one type of complex optimization problems, widely exist in various real applications. Although meta-heuristics have been successfully applied to address MOCOPs, the calculation time is often much longer. Recently, a number of deep reinforcement learning (DRL) methods have been proposed to generate approximate optimal solutions to the combinatorial optimization problems. However, the existing studies on DRL have seldom focused on MOCOPs. This study proposes a single-model deep reinforcement learning framework, called multi-objective Pointer Network (MOPN), where the input structure of PN is effectively improved so that the single PN is capable of solving MOCOPs. In addition, two training strategies, based on representative model and transfer learning, respectively, are proposed to further enhance the performance of MOPN in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Multi-Objective Optimization Algorithms · Metaheuristic Optimization Algorithms Research
Methods[LivE@PeRson]How do I talk to a real person at Expedia? · Tanh Activation · Sigmoid Activation · Long Short-Term Memory · Softmax · Pointer Network
