Joint Program and Layout Transformations to enable Convolutional Operators on Specialized Hardware based on Constraint Programming
Dennis Rieber, Axel Acosta, Holger Fr\"oning

TL;DR
This paper introduces a bottom-up constraint programming approach for joint program and data layout transformations, enabling efficient convolutional operator implementation on specialized hardware, significantly improving performance and hardware utilization.
Contribution
It proposes a novel bottom-up method using constraint satisfaction for joint transformations, contrasting with prior top-down approaches, and demonstrates its effectiveness on hardware accelerators.
Findings
Achieves up to 2.813x speedup over reference implementations.
Automatically generates competitive code for specialized hardware.
Improves hardware utilization and performance through dynamic data layout determination.
Abstract
The success of Deep Artificial Neural Networks (DNNs) in many domains created a rich body of research concerned with hardware accelerators for compute-intensive DNN operators. However, implementing such operators efficiently with complex hardware intrinsics such as matrix multiply is a task not yet automated gracefully. Solving this task often requires joint program and data layout transformations. First solutions to this problem have been proposed, such as TVM, UNIT or ISAMIR, which work on a loop-level representation of operators and specify data layout and possible program transformations before the embedding into the operator is performed. This top-down approach creates a tension between exploration range and search space complexity, especially when also exploring data layout transformations such as im2col, channel packing or padding. In this work, we propose a new approach to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning in Materials Science · Industrial Vision Systems and Defect Detection
