Less Regret via Online Conditioning

Matthew Streeter; H. Brendan McMahan

arXiv:1002.4862·cs.LG·February 26, 2010·24 cites

Less Regret via Online Conditioning

Matthew Streeter, H. Brendan McMahan

PDF

Open Access

TL;DR

This paper introduces an adaptive online gradient descent algorithm with per-coordinate learning rate adjustments, providing stronger regret bounds and competitive performance in large-scale machine learning tasks.

Contribution

It presents a novel online gradient descent method with diagonal preconditioning, improving regret bounds over standard approaches.

Findings

01

Stronger regret bounds than standard online gradient descent.

02

Competitive performance in large-scale machine learning experiments.

03

Effective per-coordinate learning rate adaptation.

Abstract

We analyze and evaluate an online gradient descent algorithm with adaptive per-coordinate adjustment of learning rates. Our algorithm can be thought of as an online version of batch gradient descent with a diagonal preconditioner. This approach leads to regret bounds that are stronger than those of standard online gradient descent for general online convex optimization problems. Experimentally, we show that our algorithm is competitive with state-of-the-art algorithms for large scale machine learning problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques