Convergence Analyses of Online ADAM Algorithm in Convex Setting and   Two-Layer ReLU Neural Network

Biyi Fang; Diego Klabjan

arXiv:1905.09356·cs.LG·November 27, 2019·5 cites

Convergence Analyses of Online ADAM Algorithm in Convex Setting and Two-Layer ReLU Neural Network

Biyi Fang, Diego Klabjan

PDF

Open Access

TL;DR

This paper introduces a new regret metric for online streaming learning, analyzes the convergence of online ADAM algorithms in convex and neural network settings, and demonstrates their effectiveness through theoretical bounds and experiments.

Contribution

It proposes a novel regret metric for streaming data, provides rigorous convergence analysis of online ADAM algorithms in convex and neural network contexts, and validates the algorithms with experiments.

Findings

01

Regret bounds of order square root of window size for convex settings.

02

First analysis of regret order in standard online setting with flawed previous proofs.

03

Neural network analysis shows similar regret bounds when initial weights are near stationary points.

Abstract

Nowadays, online learning is an appealing learning paradigm, which is of great interest in practice due to the recent emergence of large scale applications such as online advertising placement and online web ranking. Standard online learning assumes a finite number of samples while in practice data is streamed infinitely. In such a setting gradient descent with a diminishing learning rate does not work. We first introduce regret with rolling window, a new performance metric for online streaming learning, which measures the performance of an algorithm on every fixed number of contiguous samples. At the same time, we propose a family of algorithms based on gradient descent with a constant or adaptive learning rate and provide very technical analyses establishing regret bound properties of the algorithms. We cover the convex setting showing the regret of the order of the square root of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and ELM · Advanced Bandit Algorithms Research · Face and Expression Recognition

Methods*Communicated@Fast*How Do I Communicate to Expedia?