Regularizing Neural Networks by Stochastically Training Layer Ensembles

Alex Labach; Shahrokh Valaee

arXiv:1911.09669·cs.LG·November 22, 2019

Regularizing Neural Networks by Stochastically Training Layer Ensembles

Alex Labach, Shahrokh Valaee

PDF

1 Repo

TL;DR

This paper introduces STE layers, a stochastic ensemble training method that improves neural network regularization by explicitly averaging multiple weight matrices, leading to better image classification performance without extra test-time cost.

Contribution

The paper proposes STE layers, a novel stochastic ensemble training approach that enhances regularization and model averaging in neural networks.

Findings

01

Consistent improvement on image classification tasks.

02

No additional computational cost during testing.

03

Enhanced regularization compared to traditional methods.

Abstract

Dropout and similar stochastic neural network regularization methods are often interpreted as implicitly averaging over a large ensemble of models. We propose STE (stochastically trained ensemble) layers, which enhance the averaging properties of such methods by training an ensemble of weight matrices with stochastic regularization while explicitly averaging outputs. This provides stronger regularization with no additional computational cost at test time. We show consistent improvement on various image classification tasks using standard network topologies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

j201/keras-ste-layers
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTest