Hyperparameter Learning for Conditional Kernel Mean Embeddings with Rademacher Complexity Bounds
Kelvin Hsu, Richard Nock, Fabio Ramos

TL;DR
This paper introduces a scalable hyperparameter learning framework for conditional kernel mean embeddings using Rademacher complexity bounds, improving over existing methods and enabling integration with deep neural networks.
Contribution
It proposes a novel hyperparameter learning method based on Rademacher complexity bounds that avoids costly cross validation and supports scalable, non-approximate kernel tuning.
Findings
Outperforms existing hyperparameter tuning methods in experiments
Enables scalable kernel hyperparameter optimization without kernel approximations
Can be extended to learn deep neural network weights for better generalization
Abstract
Conditional kernel mean embeddings are nonparametric models that encode conditional expectations in a reproducing kernel Hilbert space. While they provide a flexible and powerful framework for probabilistic inference, their performance is highly dependent on the choice of kernel and regularization hyperparameters. Nevertheless, current hyperparameter tuning methods predominantly rely on expensive cross validation or heuristics that is not optimized for the inference task. For conditional kernel mean embeddings with categorical targets and arbitrary inputs, we propose a hyperparameter learning framework based on Rademacher complexity bounds to prevent overfitting by balancing data fit against model complexity. Our approach only requires batch updates, allowing scalable kernel hyperparameter tuning without invoking kernel approximations. Experiments demonstrate that our learning framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Machine Learning and Data Classification · Generative Adversarial Networks and Image Synthesis
