Learning Probabilistic Symmetrization for Architecture Agnostic Equivariance
Jinwoo Kim, Tien Dat Nguyen, Ayhan Suleymanzade, Hyeokjun An,, Seunghoon Hong

TL;DR
This paper introduces a probabilistic symmetrization framework that enables arbitrary base models to learn equivariant functions across diverse symmetry groups, improving flexibility and performance over traditional equivariant architectures.
Contribution
It proposes a novel probabilistic symmetrization method that allows non-equivariant models to learn equivariant functions, enhancing versatility and reducing sample complexity.
Findings
Competitive results against specialized equivariant architectures
Effective on various symmetry groups including permutation and Euclidean groups
Improved learning in symmetric modalities like graphs
Abstract
We present a novel framework to overcome the limitations of equivariant architectures in learning functions with group symmetries. In contrary to equivariant architectures, we use an arbitrary base model such as an MLP or a transformer and symmetrize it to be equivariant to the given group by employing a small equivariant network that parameterizes the probabilistic distribution underlying the symmetrization. The distribution is end-to-end trained with the base model which can maximize performance while reducing sample complexity of symmetrization. We show that this approach ensures not only equivariance to given group but also universal approximation capability in expectation. We implement our method on various base models, including patch-based transformers that can be initialized from pretrained vision transformers, and test them for a wide range of symmetry groups including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning in Bioinformatics · Fractal and DNA sequence analysis · Domain Adaptation and Few-Shot Learning
MethodsBalanced Selection
