TL;DR
This paper introduces a system that captures and annotates human activities in virtual reality, translating them into structured, robot-understandable models by combining physical simulation, gaze tracking, and symbolic event parsing.
Contribution
The novel system integrates VR, physics simulation, gaze tracking, and symbolic parsing to automatically generate semantic models of manipulation activities from human demonstrations.
Findings
Accurate mapping of human movements to virtual environments.
Effective offline and online activity annotation.
Generation of robot-understandable activity models.
Abstract
In this paper we present a system capable of collecting and annotating, human performed, robot understandable, everyday activities from virtual environments. The human movements are mapped in the simulated world using off-the-shelf virtual reality devices with full body, and eye tracking capabilities. All the interactions in the virtual world are physically simulated, thus movements and their effects are closely relatable to the real world. During the activity execution, a subsymbolic data logger is recording the environment and the human gaze on a per-frame basis, enabling offline scene reproduction and replays. Coupled with the physics engine, online monitors (symbolic data loggers) are parsing (using various grammars) and recording events, actions, and their effects in the simulated world.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
