A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative   Models

Sebastian G. Gruber; Florian Buettner

arXiv:2310.05833·cs.LG·July 11, 2024

A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative Models

Sebastian G. Gruber, Florian Buettner

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel bias-variance-covariance decomposition for kernel scores, providing a theoretical framework for uncertainty estimation in generative models across various modalities, with practical estimators and improved predictive capabilities.

Contribution

It presents the first decomposition framework for kernel scores in generative models, enabling model-agnostic uncertainty estimation with unbiased estimators.

Findings

01

Kernel entropy outperforms baselines in predicting model performance.

02

Framework applies to image, audio, and language generation.

03

Estimators require only generated samples, not the underlying models.

Abstract

Generative models, like large language models, are becoming increasingly relevant in our daily lives, yet a theoretical framework to assess their generalization behavior and uncertainty does not exist. Particularly, the problem of uncertainty estimation is commonly solved in an ad-hoc and task-dependent manner. For example, natural language approaches cannot be transferred to image generation. In this paper, we introduce the first bias-variance-covariance decomposition for kernel scores. This decomposition represents a theoretical framework from which we derive a kernel-based variance and entropy for uncertainty estimation. We propose unbiased and consistent estimators for each quantity which only require generated samples but not the underlying model itself. Based on the wide applicability of kernels, we demonstrate our framework via generalization and uncertainty experiments for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mlo-lab/bvcd_generative_models
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech Recognition and Synthesis · Natural Language Processing Techniques

MethodsDiffusion