PipeOrgan: Efficient Inter-operation Pipelining with Flexible Spatial Organization and Interconnects
Raveesh Garg, Hyoukjun Kwon, Eric Qin, Yu-Hsin Chen, Tushar Krishna,, Liangzhen Lai

TL;DR
PipeOrgan introduces a flexible spatial data organization strategy for DNN accelerators, optimizing inter-operator pipelining by adapting to layer shapes and data volumes, resulting in nearly doubling performance.
Contribution
It proposes a novel spatial organization approach that supports variable pipelining depth and granularity, enhancing energy efficiency and performance in DNN accelerators.
Findings
Achieves 1.95x performance improvement over state-of-the-art methods.
Supports variable pipeline depths and spatial organizations.
Reduces memory accesses and communication overhead.
Abstract
Because of the recent trends in Deep Neural Networks (DNN) models being memory-bound, inter-operator pipelining for DNN accelerators is emerging as a promising optimization. Inter-operator pipelining reduces costly on-chip global memory and off-chip memory accesses by forwarding the output of a layer as the input of the next layer within the compute array, which is proven to be an effective optimization by previous works. However, the design space of inter-operator pipelining is huge, and the space is not yet fully explored. In particular, identifying the right depth and granularity of pipelining (or no pipelining at all) is significantly dependent on the layer shapes and data volumes of weights and activations, and these are different even within a domain. Moreover, works divide the substrate into large chunks and map one layer onto each chunk, which requires communicating halfway…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModular Robots and Swarm Intelligence · DNA and Biological Computing · Electrowetting and Microfluidic Technologies
