Task Placement and Resource Allocation for Edge Machine Learning: A GNN-based Multi-Agent Reinforcement Learning Paradigm

Yihong Li; Xiaoxi Zhang; Tianyu Zeng; Jingpu Duan; Chuan Wu; Di Wu; Xu Chen

arXiv:2302.00571·cs.MA·September 9, 2025·1 cites

Task Placement and Resource Allocation for Edge Machine Learning: A GNN-based Multi-Agent Reinforcement Learning Paradigm

Yihong Li, Xiaoxi Zhang, Tianyu Zeng, Jingpu Duan, Chuan Wu, Di Wu, Xu Chen

PDF

Open Access

TL;DR

TapFinger is a distributed edge scheduler that uses multi-agent reinforcement learning with graph attention networks to optimize task placement and resource allocation, significantly reducing ML task completion times.

Contribution

It introduces a novel MARL-based scheduler with techniques like heterogeneous graph attention and task selection for efficient multi-resource management in edge computing.

Findings

01

Achieves up to 54.9% reduction in task completion time.

02

Improves resource efficiency over existing schedulers.

03

Supports both single-task and multi-task scheduling modes.

Abstract

Machine learning (ML) tasks are one of the major workloads in today's edge computing networks. Existing edge-cloud schedulers allocate the requested amounts of resources to each task, falling short of best utilizing the limited edge resources for ML tasks. This paper proposes TapFinger, a distributed scheduler for edge clusters that minimizes the total completion time of ML tasks through co-optimizing task placement and fine-grained multi-resource allocation. To learn the tasks' uncertain resource sensitivity and enable distributed scheduling, we adopt multi-agent reinforcement learning (MARL) and propose several techniques to make it efficient, including a heterogeneous graph attention network as the MARL backbone, a tailored task selection phase in the actor network, and the integration of Bayes' theorem and masking schemes. We first implement a single-task scheduling version, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIoT and Edge/Fog Computing · Advanced Graph Neural Networks · Age of Information Optimization