Active Bias: Training More Accurate Neural Networks by Emphasizing High   Variance Samples

Haw-Shiuan Chang; Erik Learned-Miller; Andrew McCallum

arXiv:1704.07433·stat.ML·January 9, 2018·130 cites

Active Bias: Training More Accurate Neural Networks by Emphasizing High Variance Samples

Haw-Shiuan Chang, Erik Learned-Miller, Andrew McCallum

PDF

Open Access 1 Repo

TL;DR

This paper introduces methods to improve neural network training by emphasizing high-variance samples, leading to more accurate models across diverse architectures and training techniques.

Contribution

It proposes lightweight, uncertainty-based sample weighting strategies that enhance training effectiveness beyond existing methods.

Findings

01

Improved accuracy across six datasets and multiple architectures.

02

Additional gains when combined with popular training techniques.

03

Reliable enhancement of model performance through variance-based sampling.

Abstract

Self-paced learning and hard example mining re-weight training instances to improve learning accuracy. This paper presents two improved alternatives based on lightweight estimates of sample uncertainty in stochastic gradient descent (SGD): the variance in predicted probability of the correct class across iterations of mini-batch SGD, and the proximity of the correct class probability to the decision threshold. Extensive experimental results on six datasets show that our methods reliably improve accuracy in various network architectures, including additional gains on top of other popular training techniques, such as residual learning, momentum, ADAM, batch normalization, dropout, and distillation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhangyuwangumass/Trajectory-Reweighted-Sampler
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning

MethodsStochastic Gradient Descent