Distributional Depth-Based Estimation of Object Articulation Models
Ajinkya Jain, Stephen Giguere, Rudolf Lioutikov, Scott Niekum

TL;DR
This paper introduces DUST-net, a deep learning method that estimates distributions over object articulation models from depth images, capturing uncertainties and outperforming existing point estimate methods across categories.
Contribution
It presents a novel distribution representation for articulation models using screw theory and deep learning, enabling category-independent estimation with uncertainty quantification.
Findings
DUST-net outperforms state-of-the-art methods in accuracy.
It effectively captures model uncertainty from noisy inputs.
The approach generalizes across different articulation categories.
Abstract
We propose a method that efficiently learns distributions over articulation model parameters directly from depth images without the need to know articulation model categories a priori. By contrast, existing methods that learn articulation models from raw observations typically only predict point estimates of the model parameters, which are insufficient to guarantee the safe manipulation of articulated objects. Our core contributions include a novel representation for distributions over rigid body transformations and articulation model parameters based on screw theory, von Mises-Fisher distributions, and Stiefel manifolds. Combining these concepts allows for an efficient, mathematically sound representation that implicitly satisfies the constraints that rigid body transformations and articulations must adhere to. Leveraging this representation, we introduce a novel deep learning based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Robot Manipulation and Learning · Multimodal Machine Learning Applications
