A Learning Method with Gap-Aware Generation for Heterogeneous DAG Scheduling

Ruisong Zhou; Haijun Zou; Li Zhou; Chumin Sun; Zaiwen Wen

arXiv:2603.23249·cs.LG·March 25, 2026

A Learning Method with Gap-Aware Generation for Heterogeneous DAG Scheduling

Ruisong Zhou, Haijun Zou, Li Zhou, Chumin Sun, Zaiwen Wen

PDF

Open Access

TL;DR

This paper introduces WeCAN, a reinforcement learning framework for heterogeneous DAG scheduling that models task-pool interactions, analyzes optimality gaps, and improves scheduling efficiency and makespan in complex environments.

Contribution

We propose a novel end-to-end RL method with a gap-aware generation approach, including an order-space analysis and skip-extended realization, to enhance DAG scheduling performance.

Findings

01

Improved makespan over strong baselines.

02

Inference time comparable to classical heuristics.

03

Faster inference than multi-round neural schedulers.

Abstract

Efficient scheduling of directed acyclic graphs (DAGs) in heterogeneous environments is challenging due to resource capacities and dependencies. In practice, the need for adaptability across environments with varying resource pools and task types, alongside rapid schedule generation, complicates these challenges. We propose WeCAN, an end-to-end reinforcement learning framework for heterogeneous DAG scheduling that addresses task--pool compatibility coefficients and generation-induced optimality gaps. It adopts a two-stage single-pass design: a single forward pass produces task--pool scores and global parameters, followed by a generation map that constructs schedules without repeated network calls. Its weighted cross-attention encoder models task--pool interactions gated by compatibility coefficients, and is size-agnostic to environment fluctuations. Moreover, widely used list-scheduling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Real-Time Systems Scheduling · Distributed and Parallel Computing Systems