Construction of Latent Descriptor Space and Inference Model of Hand-Object Interactions
Tadashi Matsuo, Nobutaka Shimada

TL;DR
This paper introduces an unsupervised approach to model hand-object interactions by constructing a latent descriptor space from unlabeled data, enabling inference of interaction states and object parts related to handling.
Contribution
It proposes a novel interaction descriptor space derived from unlabeled data, facilitating the understanding and inference of hand-object interactions without supervision.
Findings
Clusters correspond to interaction types
Model infers interaction states from object images
Identifies object parts related to interactions
Abstract
Appearance-based generic object recognition is a challenging problem because all possible appearances of objects cannot be registered, especially as new objects are produced every day. Function of objects, however, has a comparatively small number of prototypes. Therefore, function-based classification of new objects could be a valuable tool for generic object recognition. Object functions are closely related to hand-object interactions during handling of a functional object; i.e., how the hand approaches the object, which parts of the object and contact the hand, and the shape of the hand during interaction. Hand-object interactions are helpful for modeling object functions. However, it is difficult to assign discrete labels to interactions because an object shape and grasping hand-postures intrinsically have continuous variations. To describe these interactions, we propose the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
