TL;DR
This paper introduces neural networks that predict and generate the functionality of 3D objects by analyzing interaction contexts, enabling understanding of object use without surrounding environment information.
Contribution
It develops a novel combination of predictive, generative, and segmentation neural networks to infer and visualize object functionalities from isolated 3D objects.
Findings
fSIM-NET accurately predicts object functionalities from isolated objects.
iGEN-NET successfully synthesizes interaction contexts demonstrating object functions.
iSEG-NET effectively segments interacting objects by interaction type.
Abstract
Humans can predict the functionality of an object even without any surroundings, since their knowledge and experience would allow them to "hallucinate" the interaction or usage scenarios involving the object. We develop predictive and generative deep convolutional neural networks to replicate this feat. Specifically, our work focuses on functionalities of man-made 3D objects characterized by human-object or object-object interactions. Our networks are trained on a database of scene contexts, called interaction contexts, each consisting of a central object and one or more surrounding objects, that represent object functionalities. Given a 3D object in isolation, our functional similarity network (fSIM-NET), a variation of the triplet network, is trained to predict the functionality of the object by inferring functionality-revealing interaction contexts. fSIM-NET is complemented by a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
