{\lambda}: A Benchmark for Data-Efficiency in Long-Horizon Indoor Mobile Manipulation Robotics

Ahmed Jaafar; Shreyas Sundara Raman; Sudarshan Harithas; Yichen Wei; Sofia Juliani; Anneke Wernerfelt; Benedict Quartey; Ifrah Idrees; Jason Xinyu Liu; Stefanie Tellex

arXiv:2412.05313·cs.RO·August 5, 2025

{\lambda}: A Benchmark for Data-Efficiency in Long-Horizon Indoor Mobile Manipulation Robotics

Ahmed Jaafar, Shreyas Sundara Raman, Sudarshan Harithas, Yichen Wei, Sofia Juliani, Anneke Wernerfelt, Benedict Quartey, Ifrah Idrees, Jason Xinyu Liu, Stefanie Tellex

PDF

Open Access

TL;DR

This paper introduces the LAMBDA benchmark to evaluate data efficiency in long-horizon indoor mobile manipulation tasks, emphasizing realistic, human-collected data and comparing learning and neuro-symbolic methods.

Contribution

The paper presents a new benchmark with a realistic dataset for evaluating data efficiency in long-horizon manipulation, and compares learning and neuro-symbolic approaches.

Findings

01

Neuro-symbolic methods outperform pure learning methods in success rates.

02

Pretrained learning models still struggle with data efficiency.

03

The benchmark enables realistic evaluation of manipulation strategies.

Abstract

Learning to execute long-horizon mobile manipulation tasks is crucial for advancing robotics in household and workplace settings. However, current approaches are typically data-inefficient, underscoring the need for improved models that require realistically sized benchmarks to evaluate their efficiency. To address this, we introduce the LAMBDA ({\lambda}) benchmark-Long-horizon Actions for Mobile-manipulation Benchmarking of Directed Activities-which evaluates the data efficiency of models on language-conditioned, long-horizon, multi-room, multi-floor, pick-and-place tasks using a dataset of manageable size, more feasible for collection. Our benchmark includes 571 human-collected demonstrations that provide realism and diversity in simulated and real-world settings. Unlike planner-generated data, these trajectories offer natural variability and replay-verifiability, ensuring robust…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Multimodal Machine Learning Applications · Robotics and Automated Systems