Relative Loss Bounds for On-line Density Estimation with the Exponential   Family of Distributions

Katy S. Azoury; Manfred K. Warmuth

arXiv:1301.6677·cs.LG·January 30, 2013·29 cites

Relative Loss Bounds for On-line Density Estimation with the Exponential Family of Distributions

Katy S. Azoury, Manfred K. Warmuth

PDF

Open Access

TL;DR

This paper establishes bounds on the additional loss incurred by an online density estimation algorithm using exponential family distributions, compared to an offline optimal, for arbitrary data sequences.

Contribution

It introduces a divergence-based analysis to derive and bound the relative loss of online algorithms in exponential family density estimation.

Findings

01

Derived bounds on online vs. offline loss differences

02

Applicable to arbitrary data sequences

03

Uses a divergence based on relative entropy between exponential distributions

Abstract

We consider on-line density estimation with a parameterized density from the exponential family. The on-line algorithm receives one example at a time and maintains a parameter that is essentially an average of the past examples. After receiving an example the algorithm incurs a loss which is the negative log-likelihood of the example w.r.t. the past parameter of the algorithm. An off-line algorithm can choose the best parameter based on all the examples. We prove bounds on the additional total loss of the on-line algorithm over the total loss of the off-line algorithm. These relative loss bounds hold for an arbitrary sequence of examples. The goal is to design algorithms with the best possible relative loss bounds. We use a certain divergence to derive and analyze the algorithms. This divergence is a relative entropy between two exponential distributions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Advanced Bandit Algorithms Research · Statistical Methods and Inference