Online Linear Optimization with Many Hints
Aditya Bhaskara, Ashok Cutkosky, Ravi Kumar, Manish Purohit

TL;DR
This paper introduces an online linear optimization algorithm that leverages multiple hints per round, achieving logarithmic regret when hints correlate with costs, extending prior single-hint approaches.
Contribution
It develops a method to combine multiple hint-based algorithms, enabling improved regret bounds in online linear optimization with many hints.
Findings
Achieves logarithmic regret with multiple hints when a convex combination correlates with costs.
Extends previous work from a single hint to many hints.
Provides a novel technique for combining multiple OLO algorithms.
Abstract
We study an online linear optimization (OLO) problem in which the learner is provided access to "hint" vectors in each round prior to making a decision. In this setting, we devise an algorithm that obtains logarithmic regret whenever there exists a convex combination of the hints that has positive correlation with the cost vectors. This significantly extends prior work that considered only the case . To accomplish this, we develop a way to combine many arbitrary OLO algorithms to obtain regret only a logarithmically worse factor than the minimum regret of the original algorithms in hindsight; this result is of independent interest.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Machine Learning and Algorithms
