TensorLib: A Spatial Accelerator Generation Framework for Tensor Algebra
Liancheng Jia, Zizhang Luo, Liqiang Lu, Yun Liang

TL;DR
TensorLib is a framework that automates the generation of spatial hardware accelerators for tensor algebra, enabling efficient design space exploration and achieving significant performance improvements on FPGA.
Contribution
It introduces a systematic framework using Space-Time Transformation and parameterized modules to automate hardware generation for tensor algebra accelerators.
Findings
Achieves 21% performance improvement on FPGA
Automates hardware design with dataflow exploration
Reduces development time for spatial accelerators
Abstract
Tensor algebra finds applications in various domains, and these applications, especially when accelerated on spatial hardware accelerators, can deliver high performance and low power. Spatial hardware accelerator exhibits complex design space. Prior approaches based on manual implementation lead to low programming productivity, rendering thorough design space exploration impossible. In this paper, we propose TensorLib, a framework for generating spatial hardware accelerator for tensor algebra applications. TensorLib is motivated by the observation that, different dataflows share common hardware modules, which can be reused across different designs. To build such a framework, TensorLib first uses Space-Time Transformation to explore different dataflows, which can compactly represent the hardware dataflow using a simple transformation matrix. Next, we identify the common structures of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Embedded Systems Design Techniques · Advanced Data Storage Technologies
