Symmetries in PAC-Bayesian Learning

Armin Beck; Peter Ochs

arXiv:2510.17303·cs.LG·October 21, 2025

Symmetries in PAC-Bayesian Learning

Armin Beck, Peter Ochs

PDF

Open Access

TL;DR

This paper extends PAC-Bayesian generalization guarantees to non-compact symmetries and non-invariant data distributions, providing theoretical insights and empirical validation for symmetric models in broader settings.

Contribution

It broadens the theoretical framework of symmetries in PAC-Bayesian learning to include non-compact groups and non-invariant data, with tightened bounds and practical validation.

Findings

01

Guarantees hold for non-compact symmetries like translations.

02

Empirical results on rotated MNIST support theoretical claims.

03

Symmetric models outperform non-symmetric ones in experiments.

Abstract

Symmetries are known to improve the empirical performance of machine learning models, yet theoretical guarantees explaining these gains remain limited. Prior work has focused mainly on compact group symmetries and often assumes that the data distribution itself is invariant, an assumption rarely satisfied in real-world applications. In this work, we extend generalization guarantees to the broader setting of non-compact symmetries, such as translations and to non-invariant data distributions. Building on the PAC-Bayes framework, we adapt and tighten existing bounds, demonstrating the approach on McAllester's PAC-Bayes bound while showing that it applies to a wide range of PAC-Bayes bounds. We validate our theory with experiments on a rotated MNIST dataset with a non-uniform rotation group, where the derived guarantees not only hold but also improve upon prior results. These findings…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis · Bayesian Methods and Mixture Models