SeqPoint: Identifying Representative Iterations of Sequence-based Neural   Networks

Suchita Pati; Shaizeen Aga; Matthew D. Sinclair; and Nuwan Jayasena

arXiv:2007.10459·cs.DC·July 22, 2020

SeqPoint: Identifying Representative Iterations of Sequence-based Neural Networks

Suchita Pati, Shaizeen Aga, Matthew D. Sinclair, and Nuwan Jayasena

PDF

TL;DR

SeqPoint introduces a method to accurately identify representative iterations in sequence-based neural network training, enabling rapid profiling and analysis without extensive simulation, significantly reducing profiling time.

Contribution

The paper presents SeqPoint, a novel approach to select representative training iterations for SQNNs based on input sequence length, improving profiling efficiency and accuracy.

Findings

01

SeqPoint achieves less than 1.5% error in representing training behavior.

02

It reduces profiling time by over 200 times compared to full training.

03

Accurately projects runtime and speedups with minimal profiling data.

Abstract

The ubiquity of deep neural networks (DNNs) continues to rise, making them a crucial application class for hardware optimizations. However, detailed profiling and characterization of DNN training remains difficult as these applications often run for hours to days on real hardware. Prior works exploit the iterative nature of DNNs to profile a few training iterations. While such a strategy is sound for networks like convolutional neural networks (CNNs), where the nature of the computation is largely input independent, we observe in this work that this approach is sub-optimal for sequence-based neural networks (SQNNs) such as recurrent neural networks (RNNs). The amount and nature of computations in SQNNs can vary for each input, resulting in heterogeneity across iterations. Thus, arbitrarily selecting a few iterations is insufficient to accurately summarize the behavior of the entire…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.