Convergence Analyses of Online ADAM Algorithm in Convex Setting and Two-Layer ReLU Neural Network
Biyi Fang, Diego Klabjan

TL;DR
This paper introduces a new regret metric for online streaming learning, analyzes the convergence of online ADAM algorithms in convex and neural network settings, and demonstrates their effectiveness through theoretical bounds and experiments.
Contribution
It proposes a novel regret metric for streaming data, provides rigorous convergence analysis of online ADAM algorithms in convex and neural network contexts, and validates the algorithms with experiments.
Findings
Regret bounds of order square root of window size for convex settings.
First analysis of regret order in standard online setting with flawed previous proofs.
Neural network analysis shows similar regret bounds when initial weights are near stationary points.
Abstract
Nowadays, online learning is an appealing learning paradigm, which is of great interest in practice due to the recent emergence of large scale applications such as online advertising placement and online web ranking. Standard online learning assumes a finite number of samples while in practice data is streamed infinitely. In such a setting gradient descent with a diminishing learning rate does not work. We first introduce regret with rolling window, a new performance metric for online streaming learning, which measures the performance of an algorithm on every fixed number of contiguous samples. At the same time, we propose a family of algorithms based on gradient descent with a constant or adaptive learning rate and provide very technical analyses establishing regret bound properties of the algorithms. We cover the convex setting showing the regret of the order of the square root of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM · Advanced Bandit Algorithms Research · Face and Expression Recognition
Methods*Communicated@Fast*How Do I Communicate to Expedia?
