Impact of GPU uncertainty on the training of predictive deep neural   networks

Maciej Pietrowski; Andrzej Gajda; Takuto Yamamoto; Taisuke Kobayashi,; Lana Sinapayen; Eiji Watanabe

arXiv:2109.01451·cs.LG·October 7, 2021·1 cites

Impact of GPU uncertainty on the training of predictive deep neural networks

Maciej Pietrowski, Andrzej Gajda, Takuto Yamamoto, Taisuke Kobayashi,, Lana Sinapayen, Eiji Watanabe

PDF

Open Access

TL;DR

This paper investigates how GPU-induced uncertainties affect deep neural network training, revealing that such uncertainties can enhance learning accuracy and may be beneficial rather than solely problematic.

Contribution

It demonstrates that GPU-specific uncertainties can improve neural network training outcomes, challenging the view that hardware noise is purely detrimental.

Findings

01

GPU uncertainty increased learning accuracy in certain neural networks

02

Training on CPU alone resulted in higher error than GPU training

03

GPU-specific indeterminacy may be beneficial for neural network learning

Abstract

[retracted] We found out that the difference was dependent on the Chainer library, and does not replicate with another library (pytorch) which indicates that the results are probably due to a bug in Chainer, rather than being hardware-dependent. -- old abstract Deep neural networks often present uncertainties such as hardware- and software-derived noise and randomness. We studied the effects of such uncertainty on learning outcomes, with a particular focus on the function of graphics processing units (GPUs), and found that GPU-induced uncertainty increased learning accuracy of a certain deep neural network. When training a predictive deep neural network using only the CPU without the GPU, the learning error is higher than when training the same number of epochs using the GPU, suggesting that the GPU plays a different role in the learning process than just increasing the computational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Neural dynamics and brain function · Adversarial Robustness in Machine Learning