Augment-Connect-Explore: a Paradigm for Visual Action Planning with Data   Scarcity

Martina Lippi; Michael C. Welle; Petra Poklukar; Alessandro Marino and; Danica Kragic

arXiv:2203.13034·cs.RO·August 2, 2022

Augment-Connect-Explore: a Paradigm for Visual Action Planning with Data Scarcity

Martina Lippi, Michael C. Welle, Petra Poklukar, Alessandro Marino and, Danica Kragic

PDF

Open Access

TL;DR

This paper introduces the ACE paradigm to improve visual action planning under data scarcity by augmenting data, connecting latent states, and exploring the latent space, demonstrated on manipulation tasks with limited data.

Contribution

The paper presents a novel ACE framework that enhances the Latent Space Roadmap for visual action planning in low-data scenarios, combining data augmentation, connection creation, and targeted exploration.

Findings

01

Effective in simulated box stacking tasks

02

Successful in real-world deformable object folding

03

Improves planning accuracy with limited data

Abstract

Visual action planning particularly excels in applications where the state of the system cannot be computed explicitly, such as manipulation of deformable objects, as it enables planning directly from raw images. Even though the field has been significantly accelerated by deep learning techniques, a crucial requirement for their success is the availability of a large amount of data. In this work, we propose the Augment-Connect-Explore (ACE) paradigm to enable visual action planning in cases of data scarcity. We build upon the Latent Space Roadmap (LSR) framework which performs planning with a graph built in a low dimensional latent space. In particular, ACE is used to i) Augment the available training dataset by autonomously creating new pairs of datapoints, ii) create new unobserved Connections among representations of states in the latent graph, and iii) Explore new regions of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · Human Pose and Action Recognition