Online Linear Optimization with Many Hints

Aditya Bhaskara; Ashok Cutkosky; Ravi Kumar; Manish Purohit

arXiv:2010.03082·cs.LG·October 8, 2020

Online Linear Optimization with Many Hints

Aditya Bhaskara, Ashok Cutkosky, Ravi Kumar, Manish Purohit

PDF

Open Access 1 Video

TL;DR

This paper introduces an online linear optimization algorithm that leverages multiple hints per round, achieving logarithmic regret when hints correlate with costs, extending prior single-hint approaches.

Contribution

It develops a method to combine multiple hint-based algorithms, enabling improved regret bounds in online linear optimization with many hints.

Findings

01

Achieves logarithmic regret with multiple hints when a convex combination correlates with costs.

02

Extends previous work from a single hint to many hints.

03

Provides a novel technique for combining multiple OLO algorithms.

Abstract

We study an online linear optimization (OLO) problem in which the learner is provided access to $K$ "hint" vectors in each round prior to making a decision. In this setting, we devise an algorithm that obtains logarithmic regret whenever there exists a convex combination of the $K$ hints that has positive correlation with the cost vectors. This significantly extends prior work that considered only the case $K = 1$ . To accomplish this, we develop a way to combine many arbitrary OLO algorithms to obtain regret only a logarithmically worse factor than the minimum regret of the original algorithms in hindsight; this result is of independent interest.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Online Linear Optimization with Many Hints· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Machine Learning and Algorithms