# Combining Self-Supervised Learning and Imitation for Vision-Based Rope   Manipulation

**Authors:** Ashvin Nair, Dian Chen, Pulkit Agrawal, Phillip Isola, Pieter Abbeel,, Jitendra Malik, Sergey Levine

arXiv: 1703.02018 · 2017-03-07

## TL;DR

This paper introduces a robot system that learns to manipulate ropes by combining human demonstrations with self-supervised learning of pixel-level inverse dynamics from monocular images, enabling flexible shape control.

## Contribution

It presents a novel approach that integrates high-level human guidance with low-level self-supervised inverse models for deformable object manipulation.

## Key findings

- Robot successfully reproduces human-demonstrated rope shapes
- System operates using only monocular images and autonomous data collection
- Combines high-level plans with low-level inverse models for effective manipulation

## Abstract

Manipulation of deformable objects, such as ropes and cloth, is an important but challenging problem in robotics. We present a learning-based system where a robot takes as input a sequence of images of a human manipulating a rope from an initial to goal configuration, and outputs a sequence of actions that can reproduce the human demonstration, using only monocular images as input. To perform this task, the robot learns a pixel-level inverse dynamics model of rope manipulation directly from images in a self-supervised manner, using about 60K interactions with the rope collected autonomously by the robot. The human demonstration provides a high-level plan of what to do and the low-level inverse model is used to execute the plan. We show that by combining the high and low-level plans, the robot can successfully manipulate a rope into a variety of target shapes using only a sequence of human-provided images for direction.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1703.02018/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/1703.02018/full.md

## References

35 references — full list in the complete paper: https://tomesphere.com/paper/1703.02018/full.md

---
Source: https://tomesphere.com/paper/1703.02018