Flexible Networks for Learning Physical Dynamics of Deformable Objects
Jinhyung Park, DoHae Lee, In-Kwon Lee

TL;DR
This paper introduces TP-Net, a permutation-invariant model that predicts the future states of deformable objects from unordered point sets, achieving state-of-the-art accuracy and real-time performance.
Contribution
The paper presents a novel permutation-invariant neural network architecture for modeling deformable object dynamics directly from unordered point sets.
Findings
Achieves state-of-the-art accuracy in synthetic and real-world datasets.
Operates in real-time, enabling practical applications.
Demonstrates robustness to unordered input data.
Abstract
Learning the physical dynamics of deformable objects with particle-based representation has been the objective of many computational models in machine learning. While several state-of-the-art models have achieved this objective in simulated environments, most existing models impose a precondition, such that the input is a sequence of ordered point sets. That is, the order of the points in each point set must be the same across the entire input sequence. This precondition restrains the model from generalizing to real-world data, which is considered to be a sequence of unordered point sets. In this paper, we propose a model named time-wise PointNet (TP-Net) that solves this problem by directly consuming a sequence of unordered point sets to infer the future state of a deformable object with particle-based representation. Our model consists of a shared feature extractor that extracts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis · Advanced Neural Network Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
