Augment-Connect-Explore: a Paradigm for Visual Action Planning with Data Scarcity
Martina Lippi, Michael C. Welle, Petra Poklukar, Alessandro Marino and, Danica Kragic

TL;DR
This paper introduces the ACE paradigm to improve visual action planning under data scarcity by augmenting data, connecting latent states, and exploring the latent space, demonstrated on manipulation tasks with limited data.
Contribution
The paper presents a novel ACE framework that enhances the Latent Space Roadmap for visual action planning in low-data scenarios, combining data augmentation, connection creation, and targeted exploration.
Findings
Effective in simulated box stacking tasks
Successful in real-world deformable object folding
Improves planning accuracy with limited data
Abstract
Visual action planning particularly excels in applications where the state of the system cannot be computed explicitly, such as manipulation of deformable objects, as it enables planning directly from raw images. Even though the field has been significantly accelerated by deep learning techniques, a crucial requirement for their success is the availability of a large amount of data. In this work, we propose the Augment-Connect-Explore (ACE) paradigm to enable visual action planning in cases of data scarcity. We build upon the Latent Space Roadmap (LSR) framework which performs planning with a graph built in a low dimensional latent space. In particular, ACE is used to i) Augment the available training dataset by autonomously creating new pairs of datapoints, ii) create new unobserved Connections among representations of states in the latent graph, and iii) Explore new regions of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · Human Pose and Action Recognition
