Improved Canonicalization for Model Agnostic Equivariance

Siba Smarak Panigrahi; Arnab Kumar Mondal

arXiv:2405.14089·cs.LG·November 18, 2024

Improved Canonicalization for Model Agnostic Equivariance

Siba Smarak Panigrahi, Arnab Kumar Mondal

PDF

Open Access 1 Repo

TL;DR

This paper presents an optimization-based canonicalization method using contrastive learning to achieve architecture-agnostic equivariance efficiently, outperforming existing approaches and doubling speed for large pretrained models.

Contribution

It introduces a flexible, contrastive learning-based canonicalization approach that enables architecture-agnostic equivariance without extensive retraining.

Findings

01

Outperforms existing canonicalization methods in accuracy.

02

Speeds up canonicalization process by up to 2 times.

03

Effective for large pretrained models.

Abstract

This work introduces a novel approach to achieving architecture-agnostic equivariance in deep learning, particularly addressing the limitations of traditional layerwise equivariant architectures and the inefficiencies of the existing architecture-agnostic methods. Building equivariant models using traditional methods requires designing equivariant versions of existing models and training them from scratch, a process that is both impractical and resource-intensive. Canonicalization has emerged as a promising alternative for inducing equivariance without altering model architecture, but it suffers from the need for highly expressive and expensive equivariant networks to learn canonical orientations accurately. We propose a new optimization-based method that employs any non-equivariant network for canonicalization. Our method uses contrastive learning to efficiently learn a canonical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

arnab39/equiadapt
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems

MethodsContrastive Learning