Evaluating Spatial Accelerator Architectures with Tiled Matrix-Matrix Multiplication
Gordon E. Moon, Hyoukjun Kwon, Geonhwa Jeong, Prasanth Chatarasi,, Sivasankaran Rajamanickam, Tushar Krishna

TL;DR
This paper presents a framework to evaluate spatial accelerator architectures for machine learning by optimizing tiled GEMM mappings, achieving high performance across different hardware and workloads.
Contribution
It introduces an analytical framework for optimizing dataflow and tile sizes in tiled GEMM on spatial accelerators, enabling systematic evaluation.
Findings
Optimized GEMM mappings improve performance across accelerators.
Framework achieves high efficiency for various workloads.
Systematic evaluation aids in design choices for accelerators.
Abstract
There is a growing interest in custom spatial accelerators for machine learning applications. These accelerators employ a spatial array of processing elements (PEs) interacting via custom buffer hierarchies and networks-on-chip. The efficiency of these accelerators comes from employing optimized dataflow (i.e., spatial/temporal partitioning of data across the PEs and fine-grained scheduling) strategies to optimize data reuse. The focus of this work is to evaluate these accelerator architectures using a tiled general matrix-matrix multiplication (GEMM) kernel. To do so, we develop a framework that finds optimized mappings (dataflow and tile sizes) for a tiled GEMM for a given spatial accelerator and workload combination, leveraging an analytical cost model for runtime and energy. Our evaluations over five spatial accelerators demonstrate that the tiled GEMM mappings systematically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Stochastic Gradient Optimization Techniques · Advanced Data Storage Technologies
