Procrustes: a Dataflow and Accelerator for Sparse Deep Neural Network Training
Dingqing Yang, Amin Ghasemazar, Xiaowei Ren, Maximilian Golub, Guy, Lemieux, Mieszko Lis

TL;DR
Procrustes is a specialized hardware accelerator for sparse deep neural network training that co-designs algorithms and dataflows to efficiently produce pruned models with high accuracy, significantly reducing energy and time compared to traditional methods.
Contribution
The paper introduces a co-designed system combining sparse training algorithms with tailored hardware dataflows to efficiently accelerate sparse DNN training.
Findings
Up to 3.26× energy savings compared to dense training accelerators.
Up to 4× training speedup for sparse models.
Maintains accuracy while pruning weights by an order of magnitude.
Abstract
The success of DNN pruning has led to the development of energy-efficient inference accelerators that support pruned models with sparse weight and activation tensors. Because the memory layouts and dataflows in these architectures are optimized for the access patterns during , however, they do not efficiently support the emerging sparse techniques. In this paper, we demonstrate (a) that accelerating sparse training requires a co-design approach where algorithms are adapted to suit the constraints of hardware, and (b) that hardware for sparse DNN training must tackle constraints that do not arise in inference accelerators. As proof of concept, we adapt a sparse training algorithm to be amenable to hardware acceleration; we then develop dataflow, data layout, and load-balancing techniques to accelerate it. The resulting system is a sparse DNN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Parallel Computing and Optimization Techniques
MethodsPruning · Procrustes
