A Differentiable Recipe for Learning Visual Non-Prehensile Planar   Manipulation

Bernardo Aceituno; Alberto Rodriguez; Shubham Tulsiani; Abhinav Gupta,; Mustafa Mukadam

arXiv:2111.05318·cs.RO·November 10, 2021

A Differentiable Recipe for Learning Visual Non-Prehensile Planar Manipulation

Bernardo Aceituno, Alberto Rodriguez, Shubham Tulsiani, Abhinav Gupta,, Mustafa Mukadam

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel differentiable architecture called DLM that combines neural video decoding with contact mechanics priors to improve learning of visual non-prehensile planar manipulation tasks, especially on unseen objects.

Contribution

The work presents a new modular, fully differentiable architecture that integrates deep learning with contact mechanics for robot manipulation from videos, advancing beyond existing methods.

Findings

01

DLM outperforms learning-only methods on unseen objects and motions.

02

Combines neural models with contact mechanics priors effectively.

03

Demonstrates the benefits of differentiable optimization and simulation in manipulation learning.

Abstract

Specifying tasks with videos is a powerful technique towards acquiring novel and general robot skills. However, reasoning over mechanics and dexterous interactions can make it challenging to scale learning contact-rich manipulation. In this work, we focus on the problem of visual non-prehensile planar manipulation: given a video of an object in planar motion, find contact-aware robot actions that reproduce the same object motion. We propose a novel architecture, Differentiable Learning for Manipulation (\ours), that combines video decoding neural models with priors from contact mechanics by leveraging differentiable optimization and finite difference based simulation. Through extensive simulated experiments, we investigate the interplay between traditional model-based techniques and modern deep learning approaches. We find that our modular and fully differentiable architecture performs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

baceituno/dlm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Hand Gesture Recognition Systems