A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative Models
Sebastian G. Gruber, Florian Buettner

TL;DR
This paper introduces a novel bias-variance-covariance decomposition for kernel scores, providing a theoretical framework for uncertainty estimation in generative models across various modalities, with practical estimators and improved predictive capabilities.
Contribution
It presents the first decomposition framework for kernel scores in generative models, enabling model-agnostic uncertainty estimation with unbiased estimators.
Findings
Kernel entropy outperforms baselines in predicting model performance.
Framework applies to image, audio, and language generation.
Estimators require only generated samples, not the underlying models.
Abstract
Generative models, like large language models, are becoming increasingly relevant in our daily lives, yet a theoretical framework to assess their generalization behavior and uncertainty does not exist. Particularly, the problem of uncertainty estimation is commonly solved in an ad-hoc and task-dependent manner. For example, natural language approaches cannot be transferred to image generation. In this paper, we introduce the first bias-variance-covariance decomposition for kernel scores. This decomposition represents a theoretical framework from which we derive a kernel-based variance and entropy for uncertainty estimation. We propose unbiased and consistent estimators for each quantity which only require generated samples but not the underlying model itself. Based on the wide applicability of kernels, we demonstrate our framework via generalization and uncertainty experiments for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech Recognition and Synthesis · Natural Language Processing Techniques
MethodsDiffusion
