BEHAVIOR: Benchmark for Everyday Household Activities in Virtual,   Interactive, and Ecological Environments

Sanjana Srivastava; Chengshu Li; Michael Lingelbach; Roberto; Mart\'in-Mart\'in; Fei Xia; Kent Vainio; Zheng Lian; Cem Gokmen; Shyamal; Buch; C. Karen Liu; Silvio Savarese; Hyowon Gweon; Jiajun Wu; Li Fei-Fei

arXiv:2108.03332·cs.RO·August 10, 2021·36 cites

BEHAVIOR: Benchmark for Everyday Household Activities in Virtual, Interactive, and Ecological Environments

Sanjana Srivastava, Chengshu Li, Michael Lingelbach, Roberto, Mart\'in-Mart\'in, Fei Xia, Kent Vainio, Zheng Lian, Cem Gokmen, Shyamal, Buch, C. Karen Liu, Silvio Savarese, Hyowon Gweon, Jiajun Wu, Li Fei-Fei

PDF

Open Access

TL;DR

BEHAVIOR is a comprehensive benchmark for embodied AI involving 100 realistic household activities in simulation, designed to evaluate and advance AI agents' ability to perform complex, diverse, and ecologically valid tasks.

Contribution

It introduces a novel object-centric, predicate logic-based activity description language, a simulator-agnostic environment, and new metrics, addressing key challenges in benchmarking household activities.

Findings

01

State-of-the-art AI solutions struggle with BEHAVIOR's complexity.

02

The benchmark includes 500 human VR demonstrations as ground truth.

03

BEHAVIOR is publicly available for research use.

Abstract

We introduce BEHAVIOR, a benchmark for embodied AI with 100 activities in simulation, spanning a range of everyday household chores such as cleaning, maintenance, and food preparation. These activities are designed to be realistic, diverse, and complex, aiming to reproduce the challenges that agents must face in the real world. Building such a benchmark poses three fundamental difficulties for each activity: definition (it can differ by time, place, or person), instantiation in a simulator, and evaluation. BEHAVIOR addresses these with three innovations. First, we propose an object-centric, predicate logic-based description language for expressing an activity's initial and goal conditions, enabling generation of diverse instances for any activity. Second, we identify the simulator-agnostic features required by an underlying environment to support BEHAVIOR, and demonstrate its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Human Pose and Action Recognition · Human Motion and Animation