CIS: Composable Instruction Set for Data Streaming Applications
Yu Yang, Jordi Altay\'o Gonz\'alez, Paul Delestrac, Ahmed Hemani

TL;DR
This paper introduces a novel composable instruction set architecture (CIS) designed to enhance data streaming applications on hardware accelerators by improving instruction composability, extensibility, and performance.
Contribution
The paper presents a new CIS architecture that enables efficient, flexible, and extendable instruction sets tailored for data streaming workloads on accelerators.
Findings
CIS significantly outperforms existing ISAs in data streaming tasks.
CIS achieves near-optimal processing element utilization.
The resource-centric design facilitates easy extension with new hardware resources.
Abstract
The enhanced efficiency of hardware accelerators, including Single Instruction Multiple Data (SIMD) architectures and Coarse-Grained Reconfigurable Architectures (CGRAs), is driving significant advancements in Artificial Intelligence and Machine Learning (AI/ML) applications. These applications frequently involve data streaming operations comprised of numerous vector calculations inherently amenable to parallelization. However, despite considerable progress in hardware accelerator design, their potential remains constrained by conventional instruction set architectures (ISAs). Traditional ISAs, primarily designed for microprocessors and accelerators, emphasize computation while often neglecting instruction composability and inter-instruction cooperation. This limitation results in rigid ISAs that are difficult to extend and suffer from large control overhead in their hardware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Cloud Computing and Resource Management · Parallel Computing and Optimization Techniques
