Augmentation Component Analysis: Modeling Similarity via the Augmentation Overlaps
Lu Han, Han-Jia Ye, De-Chuan Zhan

TL;DR
Augmentation Component Analysis (ACA) introduces a novel way to model sample similarity based on augmentation overlaps, improving embedding quality in self-supervised learning by efficiently capturing semantic relationships.
Contribution
ACA models augmentation distributions as sample descriptions, enabling efficient dimension reduction and similarity preservation without explicit augmentation feature estimation.
Findings
Achieves competitive results on multiple benchmarks.
Provides a theoretical link to PCA for similarity preservation.
Efficiently captures semantic similarity through augmentation overlaps.
Abstract
Self-supervised learning aims to learn a embedding space where semantically similar samples are close. Contrastive learning methods pull views of samples together and push different samples away, which utilizes semantic invariance of augmentation but ignores the relationship between samples. To better exploit the power of augmentation, we observe that semantically similar samples are more likely to have similar augmented views. Therefore, we can take the augmented views as a special description of a sample. In this paper, we model such a description as the augmentation distribution and we call it augmentation feature. The similarity in augmentation feature reflects how much the views of two samples overlap and is related to their semantical similarity. Without computational burdens to explicitly estimate values of the augmentation feature, we propose Augmentation Component Analysis…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Face and Expression Recognition
MethodsPrincipal Components Analysis · Contrastive Learning
