A Learning Method with Gap-Aware Generation for Heterogeneous DAG Scheduling
Ruisong Zhou, Haijun Zou, Li Zhou, Chumin Sun, Zaiwen Wen

TL;DR
This paper introduces WeCAN, a reinforcement learning framework for heterogeneous DAG scheduling that models task-pool interactions, analyzes optimality gaps, and improves scheduling efficiency and makespan in complex environments.
Contribution
We propose a novel end-to-end RL method with a gap-aware generation approach, including an order-space analysis and skip-extended realization, to enhance DAG scheduling performance.
Findings
Improved makespan over strong baselines.
Inference time comparable to classical heuristics.
Faster inference than multi-round neural schedulers.
Abstract
Efficient scheduling of directed acyclic graphs (DAGs) in heterogeneous environments is challenging due to resource capacities and dependencies. In practice, the need for adaptability across environments with varying resource pools and task types, alongside rapid schedule generation, complicates these challenges. We propose WeCAN, an end-to-end reinforcement learning framework for heterogeneous DAG scheduling that addresses task--pool compatibility coefficients and generation-induced optimality gaps. It adopts a two-stage single-pass design: a single forward pass produces task--pool scores and global parameters, followed by a generation map that constructs schedules without repeated network calls. Its weighted cross-attention encoder models task--pool interactions gated by compatibility coefficients, and is size-agnostic to environment fluctuations. Moreover, widely used list-scheduling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Real-Time Systems Scheduling · Distributed and Parallel Computing Systems
