Object-Aware Cropping for Self-Supervised Learning
Shlok Mishra, Anshul Shah, Ankan Bansal, Abhyuday Jagannatha, Janit, Anjaria, Abhishek Sharma, David Jacobs, Dilip Krishnan

TL;DR
This paper introduces object-aware cropping using object proposals to improve self-supervised learning on datasets with multiple small objects, leading to significant performance gains.
Contribution
It proposes replacing random crops with object proposal-based crops in self-supervised learning, enhancing object and scene understanding in uncurated datasets.
Findings
8.8% mAP improvement on OpenImages
Significant gains on COCO and PASCAL-VOC detection tasks
Applicable to various self-supervised frameworks
Abstract
A core component of the recent success of self-supervised learning is cropping data augmentation, which selects sub-regions of an image to be used as positive views in the self-supervised loss. The underlying assumption is that randomly cropped and resized regions of a given image share information about the objects of interest, which the learned representation will capture. This assumption is mostly satisfied in datasets such as ImageNet where there is a large, centered object, which is highly likely to be present in random crops of the full image. However, in other datasets such as OpenImages or COCO, which are more representative of real world uncurated data, there are typically multiple small objects in an image. In this work, we show that self-supervised learning based on the usual random cropping performs poorly on such datasets. We propose replacing one or both of the random…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
