Regret Minimization with Noisy Observations
Mohammad Mahdian, Jieming Mao, Kangning Wang

TL;DR
This paper addresses the challenge of regret minimization in noisy observation settings without prior distribution knowledge, proposing an efficient algorithm that guarantees a constant approximation to the optimal regret.
Contribution
It introduces a novel regret minimization algorithm for noisy, adversarially chosen values that does not require prior distribution assumptions.
Findings
Naive approach of selecting the highest observed value performs poorly.
Proposed algorithm achieves a constant-factor approximation to optimal regret.
Algorithm is simple, efficient, and requires minimal noise distribution knowledge.
Abstract
In a typical optimization problem, the task is to pick one of a number of options with the lowest cost or the highest value. In practice, these cost/value quantities often come through processes such as measurement or machine learning, which are noisy, with quantifiable noise distributions. To take these noise distributions into account, one approach is to assume a prior for the values, use it to build a posterior, and then apply standard stochastic optimization to pick a solution. However, in many practical applications, such prior distributions may not be available. In this paper, we study such scenarios using a regret minimization model. In our model, the task is to pick the highest one out of values. The values are unknown and chosen by an adversary, but can be observed through noisy channels, where additive noises are stochastically drawn from known distributions. The goal is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Distributed Sensor Networks and Detection Algorithms
