Informational Confidence Bounds for Self-Normalized Averages and   Applications

Aur\'elien Garivier

arXiv:1309.3376·math.ST·November 17, 2016·ITW

Informational Confidence Bounds for Self-Normalized Averages and Applications

Aur\'elien Garivier

PDF

Open Access

TL;DR

This paper introduces deviation bounds for self-normalized averages using exponential martingale techniques, with applications in bandit problems and context tree estimation, providing a novel approach to estimation with random sample sizes.

Contribution

It presents an alternative to the mixture method for deriving deviation bounds, specifically tailored for self-normalized averages in stochastic estimation problems.

Findings

01

Provides new deviation bounds for self-normalized averages.

02

Demonstrates applications in bandit problems and context tree estimation.

03

Offers an alternative to existing methods like the mixture approach.

Abstract

We present deviation bounds for self-normalized averages and applications to estimation with a random number of observations. The results rely on a peeling argument in exponential martingale techniques that represents an alternative to the method of mixture. The motivating examples of bandit problems and context tree estimation are detailed.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Distributed Sensor Networks and Detection Algorithms