A Two-stage Framework and Reinforcement Learning-based Optimization Algorithms for Complex Scheduling Problems
Yongming He, Guohua Wu, Yingwu Chen, Witold Pedrycz

TL;DR
This paper introduces a novel two-stage framework combining reinforcement learning and traditional operations research algorithms to efficiently solve complex scheduling problems, demonstrated on satellite scheduling with promising results.
Contribution
The paper presents a new general paradigm that integrates RL with OR approaches, improving efficiency and scalability in complex scheduling problem solutions.
Findings
The proposed algorithms effectively solve satellite scheduling problems.
RL-based methods show stronger scalability than non-learning algorithms.
The framework achieves stable and efficient scheduling in diverse scenarios.
Abstract
There hardly exists a general solver that is efficient for scheduling problems due to their diversity and complexity. In this study, we develop a two-stage framework, in which reinforcement learning (RL) and traditional operations research (OR) algorithms are combined together to efficiently deal with complex scheduling problems. The scheduling problem is solved in two stages, including a finite Markov decision process (MDP) and a mixed-integer programming process, respectively. This offers a novel and general paradigm that combines RL with OR approaches to solving scheduling problems, which leverages the respective strengths of RL and OR: The MDP narrows down the search space of the original problem through an RL method, while the mixed-integer programming process is settled by an OR algorithm. These two stages are performed iteratively and interactively until the termination criterion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSatellite Communication Systems · Resource-Constrained Project Scheduling · Advanced Wireless Network Optimization
