Improving Pixel-Level Contrastive Learning by Leveraging Exogenous Depth Information
Ahmed Ben Saad, Kristina Prokopetc, Josselin Kherroubi, Axel Davy,, Adrien Courtois, Gabriele Facciolo

TL;DR
This paper enhances pixel-level contrastive learning by integrating exogenous depth information, leading to improved object shape representation and better performance on segmentation tasks across various datasets.
Contribution
The authors introduce a method that incorporates depth data into contrastive learning, improving pixel-level representations and proposing a multi-scale loss to handle different object sizes.
Findings
Achieved 1.9% improvement over PixPro on Borehole Images
Improved indoor segmentation results on ScanNet by 1.6%
Enhanced outdoor scene segmentation on CityScapes by 1.1%
Abstract
Self-supervised representation learning based on Contrastive Learning (CL) has been the subject of much attention in recent years. This is due to the excellent results obtained on a variety of subsequent tasks (in particular classification), without requiring a large amount of labeled samples. However, most reference CL algorithms (such as SimCLR and MoCo, but also BYOL and Barlow Twins) are not adapted to pixel-level downstream tasks. One existing solution known as PixPro proposes a pixel-level approach that is based on filtering of pairs of positive/negative image crops of the same image using the distance between the crops in the whole image. We argue that this idea can be further enhanced by incorporating semantic information provided by exogenous data as an additional selection filter, which can be used (at training time) to improve the selection of the pixel-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Improving Pixel-Level Contrastive Learning by Leveraging Exogenous Depth Information· youtube
Taxonomy
TopicsImage Processing Techniques and Applications · Domain Adaptation and Few-Shot Learning · Geophysical Methods and Applications
MethodsBitcoin Customer Service Number +1-833-534-1729 · 1x1 Convolution · Dense Connections · Residual Connection · Max Pooling · Average Pooling · Residual Block · Color Jitter · Bottleneck Residual Block · Normalized Temperature-scaled Cross Entropy Loss
