Minibatch training of neural network ensembles via trajectory sampling

Jamie F. Mair; Luke Causer; Juan P. Garrahan

arXiv:2306.13442·cond-mat.stat-mech·June 28, 2023·1 cites

Minibatch training of neural network ensembles via trajectory sampling

Jamie F. Mair, Luke Causer, Juan P. Garrahan

PDF

Open Access

TL;DR

This paper introduces a minibatch trajectory sampling method for efficiently training neural network ensembles, significantly reducing training time and improving inference accuracy on image classification tasks.

Contribution

It presents a novel approach to train neural network ensembles using trajectory sampling with minibatches, achieving substantial computational efficiency gains.

Findings

01

Training time reduced by two orders of magnitude on MNIST.

02

Longer trajectories improve inference accuracy.

03

Method scales with dataset and minibatch size ratio.

Abstract

Most iterative neural network training methods use estimates of the loss function over small random subsets (or minibatches) of the data to update the parameters, which aid in decoupling the training time from the (often very large) size of the training datasets. Here, we show that a minibatch approach can also be used to train neural network ensembles (NNEs) via trajectory methods in a highly efficient manner. We illustrate this approach by training NNEs to classify images in the MNIST datasets. This method gives an improvement to the training times, allowing it to scale as the ratio of the size of the dataset to that of the average minibatch size which, in the case of MNIST, gives a computational improvement typically of two orders of magnitude. We highlight the advantage of using longer trajectories to represent NNEs, both for improved accuracy in inference and reduced update cost in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Neural Networks and Applications