Flexible Networks for Learning Physical Dynamics of Deformable Objects

Jinhyung Park; DoHae Lee; In-Kwon Lee

arXiv:2112.03728·cs.CV·January 14, 2022·1 cites

Flexible Networks for Learning Physical Dynamics of Deformable Objects

Jinhyung Park, DoHae Lee, In-Kwon Lee

PDF

Open Access 1 Repo

TL;DR

This paper introduces TP-Net, a permutation-invariant model that predicts the future states of deformable objects from unordered point sets, achieving state-of-the-art accuracy and real-time performance.

Contribution

The paper presents a novel permutation-invariant neural network architecture for modeling deformable object dynamics directly from unordered point sets.

Findings

01

Achieves state-of-the-art accuracy in synthetic and real-world datasets.

02

Operates in real-time, enabling practical applications.

03

Demonstrates robustness to unordered input data.

Abstract

Learning the physical dynamics of deformable objects with particle-based representation has been the objective of many computational models in machine learning. While several state-of-the-art models have achieved this objective in simulated environments, most existing models impose a precondition, such that the input is a sequence of ordered point sets. That is, the order of the points in each point set must be the same across the entire input sequence. This precondition restrains the model from generalizing to real-world data, which is considered to be a sequence of unordered point sets. In this paper, we propose a model named time-wise PointNet (TP-Net) that solves this problem by directly consuming a sequence of unordered point sets to infer the future state of a deformable object with particle-based representation. Our model consists of a shared feature extractor that extracts…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jinhyung-park-info/TP-Net
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis · Advanced Neural Network Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings