Batch-Expansion Training: An Efficient Optimization Framework

Micha{\l} Derezi\'nski; Dhruv Mahajan; S. Sathiya Keerthi; S.; V. N. Vishwanathan; Markus Weimer

arXiv:1704.06731·cs.LG·February 26, 2018·1 cites

Batch-Expansion Training: An Efficient Optimization Framework

Micha{\l} Derezi\'nski, Dhruv Mahajan, S. Sathiya Keerthi, S., V. N. Vishwanathan, Markus Weimer

PDF

Open Access

TL;DR

Batch-Expansion Training (BET) is a resource-efficient optimization framework that gradually increases batch size, achieving optimal convergence rates and outperforming traditional stochastic and batch methods in distributed settings.

Contribution

BET introduces a novel batch expansion approach that improves efficiency and convergence without requiring resampling or parameter tuning.

Findings

01

BET achieves $O(1/\epsilon)$ convergence rate for strongly convex objectives.

02

BET outperforms standard batch and stochastic methods in distributed experiments.

03

Batch size growing exponentially enhances resource efficiency and convergence.

Abstract

We propose Batch-Expansion Training (BET), a framework for running a batch optimizer on a gradually expanding dataset. As opposed to stochastic approaches, batches do not need to be resampled i.i.d. at every iteration, thus making BET more resource efficient in a distributed setting, and when disk-access is constrained. Moreover, BET can be easily paired with most batch optimizers, does not require any parameter-tuning, and compares favorably to existing stochastic and batch methods. We show that when the batch size grows exponentially with the number of outer iterations, BET achieves optimal $O (1/ ϵ)$ data-access convergence rate for strongly convex objectives. Experiments in parallel and distributed settings show that BET performs better than standard batch and stochastic approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Machine Learning and Algorithms · Machine Learning and Data Classification