Design and Scaffolded Training of an Efficient DNN Operator for Computer Vision on the Edge
Vinod Ganesan, Pratyush Kumar

TL;DR
This paper introduces FuSeConv, an efficient DNN operator optimized for systolic arrays, along with a training methodology that improves accuracy and performance for edge computer vision tasks.
Contribution
The paper proposes FuSeConv, a new convolution operator and dataflow, plus a training scaffolding method, to significantly enhance DNN efficiency and accuracy on systolic array hardware.
Findings
Achieves 4.1-9.25X speedup on ImageNet networks.
Outperforms depthwise separable convolutions on systolic arrays.
Maintains comparable accuracy to baseline models.
Abstract
Massively parallel systolic arrays and resource-efficient depthwise separable convolutions are two promising techniques to accelerate DNN inference on the edge. Interestingly, their combination is inefficient: Computational patterns of depthwise separable convolutions do not exhibit a rhythmic systolic flow and lack sufficient data reuse to saturate systolic arrays. We formally analyse this inefficiency and propose an efficient operator, an optimal hardware dataflow, and a superior training methodology towards alleviating this. The efficient operator, called FuSeConv, is a drop-in replacement for depthwise separable convolutions. FuSeConv factorizes convolution fully along their spatial and depth dimensions. The resultant computation efficiently maps to systolic arrays. The optimal dataflow, called Spatial-Tiled Output Stationary (ST-OS), maximizes the efficiency of FuSeConv on systolic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging
MethodsConvolution
