Label optimal regret bounds for online local learning
Pranjal Awasthi, Moses Charikar, Kevin A. Lai, Andrej Risteski

TL;DR
This paper determines the optimal regret bounds for online local learning, closing the gap between existing algorithms and theoretical limits, and establishes computational hardness results based on planted clique and dense subgraph problems.
Contribution
It provides a tighter analysis showing the semi-definite programming algorithm achieves optimal regret and proves a matching computational lower bound based on planted clique and dense subgraph conjectures.
Findings
SDP-based algorithm achieves regret of O(√nLT)
Matching computational lower bounds are established
Hardness results relate to planted clique and dense subgraph problems
Abstract
We resolve an open question from (Christiano, 2014b) posed in COLT'14 regarding the optimal dependency of the regret achievable for online local learning on the size of the label set. In this framework the algorithm is shown a pair of items at each step, chosen from a set of items. The learner then predicts a label for each item, from a label set of size and receives a real valued payoff. This is a natural framework which captures many interesting scenarios such as collaborative filtering, online gambling, and online max cut among others. (Christiano, 2014a) designed an efficient online learning algorithm for this problem achieving a regret of , where is the number of rounds. Information theoretically, one can achieve a regret of . One of the main open questions left in this framework concerns closing the above gap. In this work, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Advanced Bandit Algorithms Research · Complexity and Algorithms in Graphs
