TL;DR
This paper introduces a framework for creating soft equivariant models that allow adjustable equivariance levels, improving performance and reducing error across various vision tasks and architectures.
Contribution
It presents a general method to control equivariance in pre-trained models through weight projection, with theoretical guarantees and broad applicability.
Findings
Improves performance on ImageNet benchmark.
Reduces equivariance error in multiple tasks.
Applicable to architectures like ViT and ResNet.
Abstract
Equivariance is a fundamental property in computer vision models, yet strict equivariance is rarely satisfied in real-world data, which can limit a model's performance. Controlling the degree of equivariance is therefore desirable. We propose a general framework for constructing soft equivariant models by projecting the model weights into a designed subspace. The method applies to any pre-trained architecture and provides theoretical bounds on the induced equivariance error. Empirically, we demonstrate the effectiveness of our method on multiple pre-trained backbones, including ViT and ResNet, across image classification, semantic segmentation, and human-trajectory prediction tasks. Notably, our approach improves the performance while simultaneously reducing equivariance error on the competitive ImageNet benchmark.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗ashiq24/softeq-dinov3-vitl16-pretrain-lvd1689m-ade-seg-c4-s0.8-sp0.8model· 185 dl· ♡ 1185 dl♡ 1
- 🤗ashiq24/softeq-dinov3-vitl16-pretrain-lvd1689m-ade-seg-c360-s0.1-sp0.8model· 94 dl· ♡ 194 dl♡ 1
- 🤗ashiq24/softeq-vit-base-patch16-224-voc-seg-c720-s0.90model· 146 dl146 dl
- 🤗ashiq24/softeq-dinov2-base-voc-seg-c180-s1.0-sp0.9model· 61 dl61 dl
- 🤗ashiq24/softeq-vit-base-patch16-224-voc-seg-c4-s0.9-sp0.9model· 32 dl32 dl
- 🤗ashiq24/softeq-dinov2-base-voc-seg-c4-s0.9-sp1.0model· 42 dl42 dl
- 🤗ashiq24/softeq-dinov2-base-ade20k-seg-c180-s0.8-sp1.0model· 30 dl30 dl
- 🤗ashiq24/softeq-dinov2-base-ade20k-seg-c4-s0.8-sp1.0model· 40 dl40 dl
- 🤗ashiq24/softeq-dinov3-vitl16-pretrain-lvd1689m-ade-seg-c4-s0.4-sp0.4model· 90 dl90 dl
- 🤗ashiq24/softeq-dinov3-vitl16-pretrain-lvd1689m-ade-seg-c4-s0.1-sp0.1model· 76 dl76 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
