TL;DR
This paper introduces a modified contrastive learning approach with domain randomization for robotic manipulation, resulting in more robust visual features that transfer better from simulation to real-world scenarios.
Contribution
It proposes a simple modification to contrastive loss that improves invariance to irrelevant visual properties during unsupervised feature learning.
Findings
Features are more robust to visual domain variations.
Improved transferability of learned features to real-world tasks.
Effective for both rigid and non-rigid objects.
Abstract
Robotic tasks such as manipulation with visual inputs require image features that capture the physical properties of the scene, e.g., the position and configuration of objects. Recently, it has been suggested to learn such features in an unsupervised manner from simulated, self-supervised, robot interaction; the idea being that high-level physical properties are well captured by modern physical simulators, and their representation from visual inputs may transfer well to the real world. In particular, learning methods based on noise contrastive estimation have shown promising results. To robustify the simulation-to-real transfer, domain randomization (DR) was suggested for learning features that are invariant to irrelevant visual properties such as textures or lighting. In this work, however, we show that a naive application of DR to unsupervised learning based on contrastive estimation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
