Accelerating AIGC Services with Latent Action Diffusion Scheduling in Edge Networks
Changfu Xu, Jianxiong Guo, Wanyu Lin, Haodong Zou, Wentao Fan, Tian, Wang, Xiaowen Chu, and Jiannong Cao

TL;DR
This paper introduces LAD-TS, a diffusion-based task scheduling method for edge networks that significantly reduces AIGC service delays by optimizing offloading decisions using reinforcement learning and diffusion models.
Contribution
The paper proposes a novel Latent Action Diffusion-based Task Scheduling approach for efficient AIGC service deployment at edge networks, addressing resource constraints and reducing delays.
Findings
Achieves up to 29.18% reduction in service delay.
Demonstrates effectiveness of LAD-TS in real edge system prototype.
Provides open-source implementation for reproducibility.
Abstract
Artificial Intelligence Generated Content (AIGC) has gained significant popularity for creating diverse content. Current AIGC models primarily focus on content quality within a centralized framework, resulting in a high service delay and negative user experiences. However, not only does the workload of an AIGC task depend on the AIGC model's complexity rather than the amount of data, but the large model and its multi-layer encoder structure also result in a huge demand for computational and memory resources. These unique characteristics pose new challenges in its modeling, deployment, and scheduling at edge networks. Thus, we model an offloading problem among edges for providing real AIGC services and propose LAD-TS, a novel Latent Action Diffusion-based Task Scheduling method that orchestrates multiple edge servers for expedited AIGC services. The LAD-TS generates a near-optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification · Distributed and Parallel Computing Systems · Medical Imaging Techniques and Applications
Methodstravel james · Diffusion · Focus
