Any-Subgroup Equivariant Networks via Symmetry Breaking
Abhinav Goel, Derek Lim, Hannah Lawrence, Stefanie Jegelka, Ningyuan Huang

TL;DR
This paper introduces a flexible neural network architecture, ASEN, capable of being equivariant to multiple symmetry groups simultaneously by modulating auxiliary inputs, enabling better generalization across diverse geometric data.
Contribution
The authors propose a novel approach to achieve subgroup equivariance in neural networks through symmetry-breaking inputs, relaxing the computational difficulty of exact symmetry enforcement.
Findings
ASEN outperforms separate equivariant models on graph and image tasks.
Single ASEN model achieves superior multitask and transfer learning results.
Theoretical guarantees of universality if the base model is universal.
Abstract
The inclusion of symmetries as an inductive bias, known as equivariance, often improves generalization on geometric data (e.g. grids, sets, and graphs). However, equivariant architectures are usually highly constrained, designed for symmetries chosen a priori, and not applicable to datasets with other symmetries. This precludes the development of flexible, multi-modal foundation models capable of processing diverse data equivariantly. In this work, we build a single model -- the Any-Subgroup Equivariant Network (ASEN) -- that can be simultaneously equivariant to several groups, simply by modulating a certain auxiliary input feature. In particular, we start with a fully permutation-equivariant base model, and then obtain subgroup equivariance by using a symmetry-breaking input whose automorphism group is that subgroup. However, finding an input with the desired automorphism group is…
Peer Reviews
Decision·ICLR 2026 Poster
S1. While there has been prior work on architecture agnostic invariance/equivariance and knowledge sharing/transfer across symmetries [1-3], they consider taking an unconstrained base model and adding symmetry constraints through (randomized) symmetry breaking, while this work considers an opposite and original direction of starting at an overly constrained base model and reducing symmetry constraints using a fixed symmetry-breaking input. This is an interesting direction and could facilitate fu
W1. There are several potential weaknesses stemming from the fact that the work takes an over-constrained base model and relaxes its constraints in downstream tasks. One weaknesses is related to practical implication of Theorem 2. For large groups G such as Sn as considered in this work, constructing universal and equivariant networks is in general quite hard and requires combinatorially large feature spaces [4] or some kind of randomized symmetry breaking themselves [5], making the setup of The
- The authors tackle a relevant problem: building foundation models equivariant with respect to adaptive symmetries. - The proposed solution, using symmetry breaking to realize adaptable group equivariance, is a brilliant idea.
1. Although the proposed solution is interesting, the analysis is broad but shallow. The paper aims to span both theory and practice, yet it defers the hardest, and most relevant, questions to future work, even though they are essential to validate the framework as a practical path to adaptive foundation models. In particular: - The limited analysis of scalability and computational cost is a critical omission, and should be of paramount importance in the design of foundation models. - A thorou
- Presents an interesting and elegant idea that is both simple and flexible for constructing subgroup-equivariant networks. - Provides a solid mathematical foundation, with clear and rigorous proofs of the main results. - Includes several illustrative examples that effectively clarify the concept and demonstrate its broad applicability. - Offers a unifying framework that can reproduce or extend many existing equivariant architectures with minimal effort.
Overall, I find the paper is convincing and I am not aware of any similar work, therefore the weaknesses below should be regarded as minor: - Computational efficiency and scalability are not analyzed, leaving open how well the approach performs for larger models or groups with respect to alternative approaches. - The experiments demonstrate flexibility, but not whether this is the preferred method compared to specialized architectures; it may mainly serve as a prototyping tool. - The experime
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topological and Geometric Data Analysis · Domain Adaptation and Few-Shot Learning
