Near-Linear Time Algorithm with Near-Logarithmic Regret Per Switch for   Mixable/Exp-Concave Losses

Kaan Gokcesu; Hakan Gokcesu

arXiv:2109.13786·cs.LG·September 29, 2021·1 cites

Near-Linear Time Algorithm with Near-Logarithmic Regret Per Switch for Mixable/Exp-Concave Losses

Kaan Gokcesu, Hakan Gokcesu

PDF

Open Access

TL;DR

This paper introduces a near-linear time online learning algorithm for mixable and exp-concave losses that achieves near-logarithmic regret per switch in dynamic environments, improving computational efficiency.

Contribution

It presents the first near-logarithmic regret per switch algorithm with sub-polynomial complexity, advancing online learning in dynamic settings.

Findings

01

Achieves near-logarithmic regret per switch with sub-polynomial complexity.

02

Provides deterministic guarantees in individual sequence settings.

03

Extends the online mixture framework for dynamic environments.

Abstract

We investigate the problem of online learning, which has gained significant attention in recent years due to its applicability in a wide range of fields from machine learning to game theory. Specifically, we study the online optimization of mixable loss functions with logarithmic static regret in a dynamic environment. The best dynamic estimation sequence that we compete against is selected in hindsight with full observation of the loss functions and is allowed to select different optimal estimations in different time intervals (segments). We propose an online mixture framework that uses these static solvers as the base algorithm. We show that with the suitable selection of hyper-expert creations and weighting strategies, we can achieve logarithmic and squared logarithmic regret per switch in quadratic and linearithmic computational complexity, respectively. For the first time in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Reinforcement Learning in Robotics