Accelerating Exact Combinatorial Optimization via RL-based Initialization -- A Case Study in Scheduling
Jiaqi Yin, Cunxi Yu

TL;DR
This paper introduces a two-phase RL-to-ILP scheduling framework that significantly accelerates exact combinatorial optimization for dataflow graph scheduling, achieving comparable performance with much faster runtimes on EdgeTPU platforms.
Contribution
The paper presents a novel RL-to-ILP framework that guarantees optimality and determinism while drastically reducing runtime in scheduling problems.
Findings
Achieves up to 128× speedup over traditional exact methods.
Maintains optimal scheduling performance comparable to exact methods.
Improves on-chip inference runtime and acceleration on EdgeTPU.
Abstract
Scheduling on dataflow graphs (also known as computation graphs) is an NP-hard problem. The traditional exact methods are limited by runtime complexity, while reinforcement learning (RL) and heuristic-based approaches struggle with determinism and solution quality. This research aims to develop an innovative approach that employs machine learning (ML) for addressing combinatorial optimization problems, using scheduling as a case study. The goal is to provide guarantees in optimality and determinism while maintaining the runtime cost of heuristic methods. Specifically, we introduce a novel two-phase RL-to-ILP scheduling framework, which includes three steps: 1) RL solver acts as coarse-grain scheduler, 2) solution relaxation and 3) exact solving via ILP. Our framework demonstrates the same scheduling performance compared with using exact scheduling methods while achieving up to 128…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScheduling and Optimization Algorithms · IoT and Edge/Fog Computing · Cloud Computing and Resource Management
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
