HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World
Xin Wang, Taein Kwon, Mahdi Rad, Bowen Pan, Ishani Chakraborty, Sean, Andrist, Dan Bohus, Ashley Feniello, Bugra Tekin, Felipe Vieira Frujeri, Neel, Joshi, Marc Pollefeys

TL;DR
HoloAssist is a comprehensive egocentric dataset capturing human interactions during physical tasks, designed to advance AI assistants' ability to perceive, reason, and collaborate with humans in real-world scenarios.
Contribution
The paper introduces HoloAssist, a large-scale dataset with multi-modal data and annotations for developing interactive AI assistants capable of real-world collaboration.
Findings
Insights into human correction and intervention behaviors
Benchmarks for mistake detection and intervention prediction
Analysis of task grounding and hand forecasting
Abstract
Building an interactive AI assistant that can perceive, reason, and collaborate with humans in the real world has been a long-standing pursuit in the AI community. This work is part of a broader research effort to develop intelligent agents that can interactively guide humans through performing tasks in the physical world. As a first step in this direction, we introduce HoloAssist, a large-scale egocentric human interaction dataset, where two people collaboratively complete physical manipulation tasks. The task performer executes the task while wearing a mixed-reality headset that captures seven synchronized data streams. The task instructor watches the performer's egocentric video in real time and guides them verbally. By augmenting the data with action and conversational annotations and observing the rich behaviors of various participants, we present key insights into how human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in Service Interactions · Social Robot Interaction and HRI · Virtual Reality Applications and Impacts
MethodsFast Attention Via Positive Orthogonal Random Features · Performer
