Efficient Orchestrated AI Workflows Execution on Scale-out Spatial Architecture
Jinyi Deng, Xinru Tang, Zhiheng Yue, Guangyang Lu, Qize Yang, Jiahao, Zhang, Jinxi Li, Chao Li, Shaojun Wei, Yang Hu, Shouyi Yin

TL;DR
This paper introduces Octopus, a scale-out spatial architecture with advanced scheduling strategies designed to efficiently execute complex, dynamic AI workflows, outperforming traditional systems especially at large scales.
Contribution
The paper presents Octopus, a novel architecture with specialized scheduling strategies that address the dual dynamicity of orchestrated AI workflows, improving scalability and efficiency.
Findings
Octopus significantly outperforms traditional architectures in dynamic workload handling.
The architecture demonstrates robust scalability on wafer-scale hardware.
Advanced scheduling strategies effectively manage resource allocation and load balancing.
Abstract
Given the increasing complexity of AI applications, traditional spatial architectures frequently fall short. Our analysis identifies a pattern of interconnected, multi-faceted tasks encompassing both AI and general computational processes. In response, we have conceptualized "Orchestrated AI Workflows," an approach that integrates various tasks with logic-driven decisions into dynamic, sophisticated workflows. Specifically, we find that the intrinsic Dual Dynamicity of Orchestrated AI Workflows, namely dynamic execution times and frequencies of Task Blocks, can be effectively represented using the Orchestrated Workflow Graph. Furthermore, the intrinsic Dual Dynamicity poses challenges to existing spatial architecture, namely Indiscriminate Resource Allocation, Reactive Load Rebalancing, and Contagious PEA Idleness. To overcome these challenges, we present Octopus, a scale-out spatial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · 3D Modeling in Geospatial Applications · Modular Robots and Swarm Intelligence
