GOMA: Geometrically Optimal Mapping via Analytical Modeling for Spatial Accelerators
Wulve Yang, Hailong Zou, Rui Zhou, Jionghao Zhang, Qiang Li, Gang Li, Yi Zhan, Shushan Qiao

TL;DR
GOMA introduces a geometric abstraction and analytical modeling approach to efficiently find globally optimal GEMM mappings on spatial accelerators, significantly improving energy-delay performance and solution time.
Contribution
GOMA is the first framework to guarantee globally optimal GEMM mappings using an analytical energy model and geometric abstraction, enabling fast and optimal solutions.
Findings
GOMA achieves 2.24-4.24x energy-delay product improvement over state-of-the-art mappers.
GOMA reduces mapping solution time by 3.83-73.6x.
It provides exact analytical evaluation with O(1) complexity for GEMM mappings.
Abstract
General matrix multiplication (GEMM) on spatial accelerators is highly sensitive to mapping choices in both execution efficiency and energy consumption. However, the mapping space exhibits combinatorial explosion, which makes it extremely challenging to obtain optimal mappings within an acceptable time budget. Existing approaches typically face challenges: They often lack global-optimality guarantees and become prohibitively slow as the mapping space grows. To address these limitations, we propose \textsc{GOMA}, a geometric-abstraction-based, globally optimal GEMM mapping framework via analytical modeling, which achieves efficient solving while guaranteeing optimality. \textsc{GOMA} introduces, from first principles, a geometric abstraction for GEMM mapping, yielding an exact analytical energy objective with evaluation for any given mapping. The objective is highly accurate.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Embedded Systems Design Techniques · Interconnection Networks and Systems
