Help, Anna! Visual Navigation with Natural Multimodal Assistance via   Retrospective Curiosity-Encouraging Imitation Learning

Khanh Nguyen; Hal Daum\'e III

arXiv:1909.01871·cs.HC·November 25, 2019·5 cites

Help, Anna! Visual Navigation with Natural Multimodal Assistance via Retrospective Curiosity-Encouraging Imitation Learning

Khanh Nguyen, Hal Daum\'e III

PDF

Open Access 1 Repo

TL;DR

This paper introduces 'Help, Anna!', a simulator where an agent learns to request and interpret natural language and visual assistance to improve object-finding tasks, demonstrating better help-seeking behavior and success rates.

Contribution

The paper presents a hierarchical, memory-augmented neural agent trained with imitation learning to effectively seek help in complex visual navigation tasks.

Findings

01

Agent asks for help more effectively than baselines

02

Achieves higher success rates in seen and unseen environments

03

Demonstrates the effectiveness of retrospective curiosity in imitation learning

Abstract

Mobile agents that can leverage help from humans can potentially accomplish more complex tasks than they could entirely on their own. We develop "Help, Anna!" (HANNA), an interactive photo-realistic simulator in which an agent fulfills object-finding tasks by requesting and interpreting natural language-and-vision assistance. An agent solving tasks in a HANNA environment can leverage simulated human assistants, called ANNA (Automatic Natural Navigation Assistants), which, upon request, provide natural language and visual instructions to direct the agent towards the goals. To address the HANNA problem, we develop a memory-augmented neural agent that hierarchically models multiple levels of decision-making, and an imitation learning algorithm that teaches the agent to avoid repeating past mistakes while simultaneously predicting its own chances of making future progress. Empirically, our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

khanhptnk/hanna
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Reinforcement Learning in Robotics · Artificial Intelligence in Games