MERCURY: Accelerating DNN Training By Exploiting Input Similarity
Vahid Janfaza, Kevin Weston, Moein Razavi, Shantanu Mandal, Farabi, Mahmud, Alex Hilty, Abdullah Muzahid

TL;DR
MERCURY is a hardware accelerator scheme that speeds up DNN training by exploiting input vector similarities through random projection and caching, achieving nearly double the training speed with maintained accuracy.
Contribution
MERCURY introduces a novel input similarity exploitation method using RPQ and caching, the first of its kind for accelerating DNN training in hardware.
Findings
Average 1.97x speedup in training time
Significant reduction in computations
Maintains baseline accuracy
Abstract
Deep Neural Networks (DNN) are computationally intensive to train. It consists of a large number of multidimensional dot products between many weights and input vectors. However, there can be significant similarity among input vectors. If one input vector is similar to another, its computations with the weights are similar to those of the other and, therefore, can be skipped by reusing the already-computed results. We propose a novel scheme, called MERCURY, to exploit input similarity during DNN training in a hardware accelerator. MERCURY uses Random Projection with Quantization (RPQ) to convert an input vector to a bit sequence, called Signature. A cache (MCACHE) stores signatures of recent input vectors along with the computed results. If the Signature of a new input vector matches that of an already existing vector in the MCACHE, the two vectors are found to have similarities.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
