Building Intelligent Autonomous Navigation Agents

Devendra Singh Chaplot

arXiv:2106.13415·cs.CV·June 28, 2021

Building Intelligent Autonomous Navigation Agents

Devendra Singh Chaplot

PDF

Open Access

TL;DR

This paper advances autonomous navigation by combining classical and end-to-end learning methods, enabling agents to perform complex, long-term tasks involving perception, reasoning, and planning in physical environments.

Contribution

It introduces a hybrid approach using modular learning and explicit map representations to improve long-term semantic navigation capabilities.

Findings

01

Achieved state-of-the-art results on various navigation tasks.

02

Demonstrated effective long-term spatial and semantic understanding.

03

Successfully integrated classical and deep learning methods for navigation.

Abstract

Breakthroughs in machine learning in the last decade have led to `digital intelligence', i.e. machine learning models capable of learning from vast amounts of labeled data to perform several digital tasks such as speech recognition, face recognition, machine translation and so on. The goal of this thesis is to make progress towards designing algorithms capable of `physical intelligence', i.e. building intelligent autonomous navigation agents capable of learning to perform complex navigation tasks in the physical world involving visual perception, natural language understanding, reasoning, planning, and sequential decision making. Despite several advances in classical navigation methods in the last few decades, current navigation agents struggle at long-term semantic navigation tasks. In the first part of the thesis, we discuss our work on short-term navigation using end-to-end…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Reinforcement Learning in Robotics · AI-based Problem Solving and Planning