Seeing All the Angles: Learning Multiview Manipulation Policies for   Contact-Rich Tasks from Demonstrations

Trevor Ablett; Yifan Zhai; Jonathan Kelly

arXiv:2104.13907·cs.RO·July 11, 2022

Seeing All the Angles: Learning Multiview Manipulation Policies for Contact-Rich Tasks from Demonstrations

Trevor Ablett, Yifan Zhai, Jonathan Kelly

PDF

1 Repo

TL;DR

This paper introduces a method for learning multiview visuomotor manipulation policies from demonstrations, enabling robots to perform contact-rich tasks from multiple viewpoints, both in simulation and real-world settings.

Contribution

It demonstrates that multiview policies can be learned via imitation learning from diverse viewpoints, improving robustness and generalization in robotic manipulation tasks.

Findings

01

Multiview policies perform well across various tasks and viewpoints.

02

Learning from multiview data does not reduce performance on fixed-view tasks.

03

Multiview policies implicitly learn spatially correlated visual features.

Abstract

Learned visuomotor policies have shown considerable success as an alternative to traditional, hand-crafted frameworks for robotic manipulation. Surprisingly, an extension of these methods to the multiview domain is relatively unexplored. A successful multiview policy could be deployed on a mobile manipulation platform, allowing the robot to complete a task regardless of its view of the scene. In this work, we demonstrate that a multiview policy can be found through imitation learning by collecting data from a variety of viewpoints. We illustrate the general applicability of the method by learning to complete several challenging multi-stage and contact-rich tasks, from numerous viewpoints, both in a simulated environment and on a real mobile manipulation platform. Furthermore, we analyze our policies to determine the benefits of learning from multiview data compared to learning with data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

utiasSTARS/multiview-manipulation
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.