DoughNet: A Visual Predictive Model for Topological Manipulation of   Deformable Objects

Dominik Bauer; Zhenjia Xu; Shuran Song

arXiv:2404.12524·cs.CV·April 22, 2024

DoughNet: A Visual Predictive Model for Topological Manipulation of Deformable Objects

Dominik Bauer, Zhenjia Xu, Shuran Song

PDF

Open Access

TL;DR

DoughNet is a Transformer-based model that predicts topological and geometrical changes in deformable objects like dough, enabling better planning of robotic manipulation involving splitting and merging.

Contribution

Introducing DoughNet, a novel autoencoder and autoregressive set prediction model for topological manipulation of deformable objects in latent space.

Findings

01

Outperforms existing deformation prediction methods

02

Accurately predicts topological changes in simulated environments

03

Effective in real-world robotic manipulation tasks

Abstract

Manipulation of elastoplastic objects like dough often involves topological changes such as splitting and merging. The ability to accurately predict these topological changes that a specific action might incur is critical for planning interactions with elastoplastic objects. We present DoughNet, a Transformer-based architecture for handling these challenges, consisting of two components. First, a denoising autoencoder represents deformable objects of varying topology as sets of latent codes. Second, a visual predictive model performs autoregressive set prediction to determine long-horizon geometrical deformation and topological changes purely in latent space. Given a partial initial state and desired manipulation trajectories, it infers all resulting object geometries and topologies at each step. DoughNet thereby allows to plan robotic manipulation; selecting a suited tool, its pose and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Surveying and Cultural Heritage · Image Processing and 3D Reconstruction · Manufacturing Process and Optimization

MethodsSparse Evolutionary Training · Denoising Autoencoder