Universal Batch Learning Under The Misspecification Setting

Shlomi Vituri; Meir Feder

arXiv:2405.07252·cs.LG·June 25, 2024

Universal Batch Learning Under The Misspecification Setting

Shlomi Vituri, Meir Feder

PDF

Open Access

TL;DR

This paper investigates universal batch learning under model misspecification with log-loss, deriving optimal strategies and bounds that depend on hypothesis class complexity rather than data distribution complexity.

Contribution

It introduces a minimax optimal universal learner as a mixture over data-generating distributions and provides a closed-form expression for the min-max regret.

Findings

01

Derived the optimal universal learner using minimax and information theory.

02

Established tight bounds showing complexity depends on hypothesis class richness.

03

Extended Arimoto-Blahut algorithm for numerical evaluation of regret.

Abstract

In this paper we consider the problem of universal {\em batch} learning in a misspecification setting with log-loss. In this setting the hypothesis class is a set of models $Θ$ . However, the data is generated by an unknown distribution that may not belong to this set but comes from a larger set of models $Φ \supset Θ$ . Given a training sample, a universal learner is requested to predict a probability distribution for the next outcome and a log-loss is incurred. The universal learner performance is measured by the regret relative to the best hypothesis matching the data, chosen from $Θ$ . Utilizing the minimax theorem and information theoretical tools, we derive the optimal universal learner, a mixture over the set of the data generating distributions, and get a closed form expression for the min-max regret. We show that this regret can be considered as a constrained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms

MethodsSparse Evolutionary Training