Improved Canonicalization for Model Agnostic Equivariance
Siba Smarak Panigrahi, Arnab Kumar Mondal

TL;DR
This paper presents an optimization-based canonicalization method using contrastive learning to achieve architecture-agnostic equivariance efficiently, outperforming existing approaches and doubling speed for large pretrained models.
Contribution
It introduces a flexible, contrastive learning-based canonicalization approach that enables architecture-agnostic equivariance without extensive retraining.
Findings
Outperforms existing canonicalization methods in accuracy.
Speeds up canonicalization process by up to 2 times.
Effective for large pretrained models.
Abstract
This work introduces a novel approach to achieving architecture-agnostic equivariance in deep learning, particularly addressing the limitations of traditional layerwise equivariant architectures and the inefficiencies of the existing architecture-agnostic methods. Building equivariant models using traditional methods requires designing equivariant versions of existing models and training them from scratch, a process that is both impractical and resource-intensive. Canonicalization has emerged as a promising alternative for inducing equivariance without altering model architecture, but it suffers from the need for highly expressive and expensive equivariant networks to learn canonical orientations accurately. We propose a new optimization-based method that employs any non-equivariant network for canonicalization. Our method uses contrastive learning to efficiently learn a canonical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems
MethodsContrastive Learning
