Ask4Help: Learning to Leverage an Expert for Embodied Tasks
Kunal Pratap Singh, Luca Weihs, Alvaro Herrasti, Jonghyun Choi,, Aniruddha Kemhavi, Roozbeh Mottaghi

TL;DR
Ask4Help enables embodied AI agents to request expert assistance, significantly improving task success rates with minimal help, by training policies that balance performance and help cost without altering the original agent.
Contribution
The paper introduces Ask4Help, a novel approach allowing agents to learn when and how to request expert help, improving performance efficiently without modifying existing models.
Findings
Object navigation success increases from 52% to 86% with 13% help.
Room rearrangement success rises from 7% to 90.4% with 39% help.
Human trials confirm practical effectiveness of Ask4Help.
Abstract
Embodied AI agents continue to become more capable every year with the advent of new models, environments, and benchmarks, but are still far away from being performant and reliable enough to be deployed in real, user-facing, applications. In this paper, we ask: can we bridge this gap by enabling agents to ask for assistance from an expert such as a human being? To this end, we propose the Ask4Help policy that augments agents with the ability to request, and then use expert assistance. Ask4Help policies can be efficiently trained without modifying the original agent's parameters and learn a desirable trade-off between task performance and the amount of requested help, thereby reducing the cost of querying the expert. We evaluate Ask4Help on two different tasks -- object goal navigation and room rearrangement and see substantial improvements in performance using minimal help. On object…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Context-Aware Activity Recognition Systems · Mobile Crowdsensing and Crowdsourcing
