Greedy Step Averaging: A parameter-free stochastic optimization method

Xiatian Zhang; Fan Yao; Yongjun Tian

arXiv:1611.03608·cs.LG·November 14, 2016

Greedy Step Averaging: A parameter-free stochastic optimization method

Xiatian Zhang, Fan Yao, Yongjun Tian

PDF

Open Access 1 Repo

TL;DR

The paper introduces GSA, a parameter-free stochastic optimization algorithm that adaptively determines learning rates without hyperparameter tuning, demonstrating robustness across multiple datasets.

Contribution

GSA is a novel gradient-based method that eliminates the need for manual learning rate tuning, simplifying the optimization process in machine learning.

Findings

01

GSA performs comparably or better than state-of-the-art methods on 16 datasets.

02

GSA is robust across various machine learning scenarios.

03

GSA requires no hyperparameter tuning or additional computational cost.

Abstract

In this paper we present the greedy step averaging(GSA) method, a parameter-free stochastic optimization algorithm for a variety of machine learning problems. As a gradient-based optimization method, GSA makes use of the information from the minimizer of a single sample's loss function, and takes average strategy to calculate reasonable learning rate sequence. While most existing gradient-based algorithms introduce an increasing number of hyper parameters or try to make a trade-off between computational cost and convergence rate, GSA avoids the manual tuning of learning rate and brings in no more hyper parameters or extra cost. We perform exhaustive numerical experiments for logistic and softmax regression to compare our method with the other state of the art ones on 16 datasets. Results show that GSA is robust on various scenarios.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TalkingData/Fregata
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research · Sparse and Compressive Sensing Techniques

MethodsSoftmax