OakInk: A Large-scale Knowledge Repository for Understanding Hand-Object   Interaction

Lixin Yang; Kailin Li; Xinyu Zhan; Fei Wu; Anran Xu; Liu Liu; Cewu Lu

arXiv:2203.15709·cs.CV·March 30, 2022

OakInk: A Large-scale Knowledge Repository for Understanding Hand-Object Interaction

Lixin Yang, Kailin Li, Xinyu Zhan, Fei Wu, Anran Xu, Liu Liu, Cewu Lu

PDF

Open Access 1 Repo 1 Datasets

TL;DR

OakInk is a comprehensive large-scale knowledge repository that captures human-object interactions and affordances, enabling improved understanding and generation of hand-object interactions for robotics and AI applications.

Contribution

The paper introduces OakInk, the first large-scale, multi-modal knowledge base combining object affordances and human interactions, with a novel transfer method Tink for virtual interaction synthesis.

Findings

01

OakInk contains 50,000 annotated hand-object interactions.

02

Benchmark results demonstrate improved pose estimation and grasp generation.

03

Practical applications include intent-based interaction and handover generation.

Abstract

Learning how humans manipulate objects requires machines to acquire knowledge from two perspectives: one for understanding object affordances and the other for learning human's interactions based on the affordances. Even though these two knowledge bases are crucial, we find that current databases lack a comprehensive awareness of them. In this work, we propose a multi-modal and rich-annotated knowledge repository, OakInk, for visual and cognitive understanding of hand-object interactions. We start to collect 1,800 common household objects and annotate their affordances to construct the first knowledge base: Oak. Given the affordance, we record rich human interactions with 100 selected objects in Oak. Finally, we transfer the interactions on the 100 recorded objects to their virtual counterparts through a novel method: Tink. The recorded and transferred hand-object interactions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lixiny/oakink
pytorchOfficial

Datasets

oakink/OakInk-v1
dataset· 281 dl
281 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Multimodal Machine Learning Applications · Human Pose and Action Recognition