Online Action Learning in High Dimensions: A Conservative Perspective
Claudio Cardoso Flores, Marcelo Cunha Medeiros

TL;DR
This paper extends the $ ext{ extepsilon}_t$-greedy algorithm for high-dimensional sequential learning by incorporating a conservative approach that restricts exploration to promising actions, providing theoretical bounds and practical flexibility.
Contribution
It introduces a conservative high-dimensional $ ext{ extepsilon}_t$-greedy rule with theoretical regret bounds and practical tunability, improving upon non-conservative methods.
Findings
Bounded cumulative regret with high probability.
Lower bound on viable action set size improves regret bounds.
Flexible safety tuning without affecting theoretical guarantees.
Abstract
Sequential learning problems are common in several fields of research and practical applications. Examples include dynamic pricing and assortment, design of auctions and incentives and permeate a large number of sequential treatment experiments. In this paper, we extend one of the most popular learning solutions, the -greedy heuristics, to high-dimensional contexts considering a conservative directive. We do this by allocating part of the time the original rule uses to adopt completely new actions to a more focused search in a restrictive set of promising actions. The resulting rule might be useful for practical applications that still values surprises, although at a decreasing rate, while also has restrictions on the adoption of unusual actions. With high probability, we find reasonable bounds for the cumulative regret of a conservative high-dimensional decaying…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Auction Theory and Applications
