Impact of GPU uncertainty on the training of predictive deep neural networks
Maciej Pietrowski, Andrzej Gajda, Takuto Yamamoto, Taisuke Kobayashi,, Lana Sinapayen, Eiji Watanabe

TL;DR
This paper investigates how GPU-induced uncertainties affect deep neural network training, revealing that such uncertainties can enhance learning accuracy and may be beneficial rather than solely problematic.
Contribution
It demonstrates that GPU-specific uncertainties can improve neural network training outcomes, challenging the view that hardware noise is purely detrimental.
Findings
GPU uncertainty increased learning accuracy in certain neural networks
Training on CPU alone resulted in higher error than GPU training
GPU-specific indeterminacy may be beneficial for neural network learning
Abstract
[retracted] We found out that the difference was dependent on the Chainer library, and does not replicate with another library (pytorch) which indicates that the results are probably due to a bug in Chainer, rather than being hardware-dependent. -- old abstract Deep neural networks often present uncertainties such as hardware- and software-derived noise and randomness. We studied the effects of such uncertainty on learning outcomes, with a particular focus on the function of graphics processing units (GPUs), and found that GPU-induced uncertainty increased learning accuracy of a certain deep neural network. When training a predictive deep neural network using only the CPU without the GPU, the learning error is higher than when training the same number of epochs using the GPU, suggesting that the GPU plays a different role in the learning process than just increasing the computational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Neural dynamics and brain function · Adversarial Robustness in Machine Learning
