MERCURY: Accelerating DNN Training By Exploiting Input Similarity

Vahid Janfaza; Kevin Weston; Moein Razavi; Shantanu Mandal; Farabi; Mahmud; Alex Hilty; Abdullah Muzahid

arXiv:2110.14904·cs.AR·November 4, 2022

MERCURY: Accelerating DNN Training By Exploiting Input Similarity

Vahid Janfaza, Kevin Weston, Moein Razavi, Shantanu Mandal, Farabi, Mahmud, Alex Hilty, Abdullah Muzahid

PDF

Open Access

TL;DR

MERCURY is a hardware accelerator scheme that speeds up DNN training by exploiting input vector similarities through random projection and caching, achieving nearly double the training speed with maintained accuracy.

Contribution

MERCURY introduces a novel input similarity exploitation method using RPQ and caching, the first of its kind for accelerating DNN training in hardware.

Findings

01

Average 1.97x speedup in training time

02

Significant reduction in computations

03

Maintains baseline accuracy

Abstract

Deep Neural Networks (DNN) are computationally intensive to train. It consists of a large number of multidimensional dot products between many weights and input vectors. However, there can be significant similarity among input vectors. If one input vector is similar to another, its computations with the weights are similar to those of the other and, therefore, can be skipped by reusing the already-computed results. We propose a novel scheme, called MERCURY, to exploit input similarity during DNN training in a hardware accelerator. MERCURY uses Random Projection with Quantization (RPQ) to convert an input vector to a bit sequence, called Signature. A cache (MCACHE) stores signatures of recent input vectors along with the computed results. If the Signature of a new input vector matches that of an already existing vector in the MCACHE, the two vectors are found to have similarities.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques