Hyperparameter Learning for Conditional Kernel Mean Embeddings with   Rademacher Complexity Bounds

Kelvin Hsu; Richard Nock; Fabio Ramos

arXiv:1809.00175·stat.ML·November 9, 2018

Hyperparameter Learning for Conditional Kernel Mean Embeddings with Rademacher Complexity Bounds

Kelvin Hsu, Richard Nock, Fabio Ramos

PDF

Open Access 1 Repo

TL;DR

This paper introduces a scalable hyperparameter learning framework for conditional kernel mean embeddings using Rademacher complexity bounds, improving over existing methods and enabling integration with deep neural networks.

Contribution

It proposes a novel hyperparameter learning method based on Rademacher complexity bounds that avoids costly cross validation and supports scalable, non-approximate kernel tuning.

Findings

01

Outperforms existing hyperparameter tuning methods in experiments

02

Enables scalable kernel hyperparameter optimization without kernel approximations

03

Can be extended to learn deep neural network weights for better generalization

Abstract

Conditional kernel mean embeddings are nonparametric models that encode conditional expectations in a reproducing kernel Hilbert space. While they provide a flexible and powerful framework for probabilistic inference, their performance is highly dependent on the choice of kernel and regularization hyperparameters. Nevertheless, current hyperparameter tuning methods predominantly rely on expensive cross validation or heuristics that is not optimized for the inference task. For conditional kernel mean embeddings with categorical targets and arbitrary inputs, we propose a hyperparameter learning framework based on Rademacher complexity bounds to prevent overfitting by balancing data fit against model complexity. Our approach only requires batch updates, allowing scalable kernel hyperparameter tuning without invoking kernel approximations. Experiments demonstrate that our learning framework…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Kelvin-Hsu/cake
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Machine Learning and Data Classification · Generative Adversarial Networks and Image Synthesis