Self-supervised Transformation Learning for Equivariant Representations
Jaemyung Yu, Jaehyun Choi, Dong-Jae Lee, HyeongGwon Hong, Junmo Kim

TL;DR
This paper introduces Self-supervised Transformation Learning (STL), a novel method that learns equivariant representations without relying on transformation labels, improving performance across diverse vision tasks and handling complex transformations.
Contribution
STL replaces transformation labels with learned representations, enabling effective equivariant learning without increased batch complexity or dependency on transformation labels.
Findings
Outperforms existing methods in 7 out of 11 benchmarks
Effective with complex transformations like AugMix
Compatible with various base models
Abstract
Unsupervised representation learning has significantly advanced various machine learning tasks. In the computer vision domain, state-of-the-art approaches utilize transformations like random crop and color jitter to achieve invariant representations, embedding semantically the same inputs despite transformations. However, this can degrade performance in tasks requiring precise features, such as localization or flower classification. To address this, recent research incorporates equivariant representation learning, which captures transformation-sensitive information. However, current methods depend on transformation labels and thus struggle with interdependency and complex transformations. We propose Self-supervised Transformation Learning (STL), replacing transformation labels with transformation representations derived from image pairs. The proposed method ensures transformation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
MethodsAugMix · Balanced Selection
