Construction of Latent Descriptor Space and Inference Model of   Hand-Object Interactions

Tadashi Matsuo; Nobutaka Shimada

arXiv:1709.03739·cs.CV·September 13, 2017

Construction of Latent Descriptor Space and Inference Model of Hand-Object Interactions

Tadashi Matsuo, Nobutaka Shimada

PDF

TL;DR

This paper introduces an unsupervised approach to model hand-object interactions by constructing a latent descriptor space from unlabeled data, enabling inference of interaction states and object parts related to handling.

Contribution

It proposes a novel interaction descriptor space derived from unlabeled data, facilitating the understanding and inference of hand-object interactions without supervision.

Findings

01

Clusters correspond to interaction types

02

Model infers interaction states from object images

03

Identifies object parts related to interactions

Abstract

Appearance-based generic object recognition is a challenging problem because all possible appearances of objects cannot be registered, especially as new objects are produced every day. Function of objects, however, has a comparatively small number of prototypes. Therefore, function-based classification of new objects could be a valuable tool for generic object recognition. Object functions are closely related to hand-object interactions during handling of a functional object; i.e., how the hand approaches the object, which parts of the object and contact the hand, and the shape of the hand during interaction. Hand-object interactions are helpful for modeling object functions. However, it is difficult to assign discrete labels to interactions because an object shape and grasping hand-postures intrinsically have continuous variations. To describe these interactions, we propose the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.