Training neural network ensembles via trajectory sampling

Jamie F. Mair; Dominic C. Rose; Juan P. Garrahan

arXiv:2209.11116·cond-mat.stat-mech·May 11, 2023

Training neural network ensembles via trajectory sampling

Jamie F. Mair, Dominic C. Rose, Juan P. Garrahan

PDF

Open Access

TL;DR

This paper introduces a novel method for training neural network ensembles by sampling parameter trajectories biased towards low loss, using techniques from stochastic systems, offering an alternative to gradient-based training.

Contribution

It proposes a new trajectory sampling approach for training neural network ensembles, leveraging stochastic dynamics and biasing techniques, which differs from traditional gradient methods.

Findings

01

Effective on simple supervised tasks

02

Potential advantages over gradient-based methods

03

Demonstrates viability of trajectory sampling approach

Abstract

In machine learning, there is renewed interest in neural network ensembles (NNEs), whereby predictions are obtained as an aggregate from a diverse set of smaller models, rather than from a single larger model. Here, we show how to define and train a NNE using techniques from the study of rare trajectories in stochastic systems. We define an NNE in terms of the trajectory of the model parameters under a simple, and discrete in time, diffusive dynamics, and train the NNE by biasing these trajectories towards a small time-integrated loss, as controlled by appropriate counting fields which act as hyperparameters. We demonstrate the viability of this technique on a range of simple supervised learning tasks. We discuss potential advantages of our trajectory sampling approach compared with more conventional gradient based methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Gaussian Processes and Bayesian Inference · Time Series Analysis and Forecasting