On the robustness of self-supervised representations for multi-view object classification
David Torpey, Richard Klein

TL;DR
This paper demonstrates that self-supervised representations, especially those based on instance discrimination, are more robust to viewpoint changes and complex scenes in multi-view object classification tasks compared to supervised methods.
Contribution
It shows that self-supervised pre-training yields more viewpoint-invariant object representations, improving robustness in multi-view classification scenarios.
Findings
Self-supervised methods outperform supervised baselines in viewpoint robustness.
Self-supervised representations encode more relevant object information.
Robustness tested through homographies and real-world multi-view datasets.
Abstract
It is known that representations from self-supervised pre-training can perform on par, and often better, on various downstream tasks than representations from fully-supervised pre-training. This has been shown in a host of settings such as generic object classification and detection, semantic segmentation, and image retrieval. However, some issues have recently come to the fore that demonstrate some of the failure modes of self-supervised representations, such as performance on non-ImageNet-like data, or complex scenes. In this paper, we show that self-supervised representations based on the instance discrimination objective lead to better representations of objects that are more robust to changes in the viewpoint and perspective of the object. We perform experiments of modern self-supervised methods against multiple supervised baselines to demonstrate this, including approximating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
