On the robustness of self-supervised representations for multi-view   object classification

David Torpey; Richard Klein

arXiv:2208.00787·cs.CV·August 2, 2022

On the robustness of self-supervised representations for multi-view object classification

David Torpey, Richard Klein

PDF

TL;DR

This paper demonstrates that self-supervised representations, especially those based on instance discrimination, are more robust to viewpoint changes and complex scenes in multi-view object classification tasks compared to supervised methods.

Contribution

It shows that self-supervised pre-training yields more viewpoint-invariant object representations, improving robustness in multi-view classification scenarios.

Findings

01

Self-supervised methods outperform supervised baselines in viewpoint robustness.

02

Self-supervised representations encode more relevant object information.

03

Robustness tested through homographies and real-world multi-view datasets.

Abstract

It is known that representations from self-supervised pre-training can perform on par, and often better, on various downstream tasks than representations from fully-supervised pre-training. This has been shown in a host of settings such as generic object classification and detection, semantic segmentation, and image retrieval. However, some issues have recently come to the fore that demonstrate some of the failure modes of self-supervised representations, such as performance on non-ImageNet-like data, or complex scenes. In this paper, we show that self-supervised representations based on the instance discrimination objective lead to better representations of objects that are more robust to changes in the viewpoint and perspective of the object. We perform experiments of modern self-supervised methods against multiple supervised baselines to demonstrate this, including approximating…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.