Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy
Tong Zhang, Congpei Qiu, Wei Ke, Sabine S\"usstrunk, Mathieu Salzmann

TL;DR
This paper introduces LoGo, a self-supervised learning strategy that explicitly models local and global image crops to improve representation diversity and transfer learning performance, outperforming existing methods.
Contribution
LoGo is a novel SSL approach that encourages similarity between global and local crops while promoting dissimilarity among local crops, enhancing representation diversity.
Findings
Outperforms existing SSL methods on various datasets.
Achieves better transfer learning results than supervised models with less data.
Can be integrated into existing SSL frameworks easily.
Abstract
Self-supervised learning (SSL) methods aim to learn view-invariant representations by maximizing the similarity between the features extracted from different crops of the same image regardless of cropping size and content. In essence, this strategy ignores the fact that two crops may truly contain different image information, e.g., background and small objects, and thus tends to restrain the diversity of the learned representations. In this work, we address this issue by introducing a new self-supervised learning strategy, LoGo, that explicitly reasons about Local and Global crops. To achieve view invariance, LoGo encourages similarity between global crops from the same image, as well as between a global and a local crop. However, to correctly encode the fact that the content of smaller crops may differ entirely, LoGo promotes two local crops to have dissimilar representations, while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Remote-Sensing Image Classification
