Speeding-Up Back-Propagation in DNN: Approximate Outer Product with   Memory

Eduin E. Hernandez; Stefano Rini; Tolga M. Duman

arXiv:2110.09164·cs.LG·October 19, 2021

Speeding-Up Back-Propagation in DNN: Approximate Outer Product with Memory

Eduin E. Hernandez, Stefano Rini, Tolga M. Duman

PDF

Open Access 1 Repo

TL;DR

This paper introduces Mem-AOP-GD, an approximation method for back-propagation in DNN training that reduces computation by selectively using outer products and maintaining memory of omitted terms, improving efficiency and accuracy.

Contribution

The paper proposes Mem-AOP-GD, a novel algorithm that approximates backpropagation using subset outer products with memory correction, enhancing training speed and accuracy.

Findings

01

Significant reduction in computational complexity.

02

Improved training accuracy with approximation.

03

Effective outer product selection policies.

Abstract

In this paper, an algorithm for approximate evaluation of back-propagation in DNN training is considered, which we term Approximate Outer Product Gradient Descent with Memory (Mem-AOP-GD). The Mem-AOP-GD algorithm implements an approximation of the stochastic gradient descent by considering only a subset of the outer products involved in the matrix multiplications that encompass backpropagation. In order to correct for the inherent bias in this approximation, the algorithm retains in memory an accumulation of the outer products that are not used in the approximation. We investigate the performance of the proposed algorithm in terms of DNN training loss under two design parameters: (i) the number of outer products used for the approximation, and (ii) the policy used to select such outer products. We experimentally show that significant improvements in computational complexity as well as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hernandezeduin/mem-aop-gd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Machine Learning and ELM · Advanced Neural Network Applications