Building Intelligent Autonomous Navigation Agents
Devendra Singh Chaplot

TL;DR
This paper advances autonomous navigation by combining classical and end-to-end learning methods, enabling agents to perform complex, long-term tasks involving perception, reasoning, and planning in physical environments.
Contribution
It introduces a hybrid approach using modular learning and explicit map representations to improve long-term semantic navigation capabilities.
Findings
Achieved state-of-the-art results on various navigation tasks.
Demonstrated effective long-term spatial and semantic understanding.
Successfully integrated classical and deep learning methods for navigation.
Abstract
Breakthroughs in machine learning in the last decade have led to `digital intelligence', i.e. machine learning models capable of learning from vast amounts of labeled data to perform several digital tasks such as speech recognition, face recognition, machine translation and so on. The goal of this thesis is to make progress towards designing algorithms capable of `physical intelligence', i.e. building intelligent autonomous navigation agents capable of learning to perform complex navigation tasks in the physical world involving visual perception, natural language understanding, reasoning, planning, and sequential decision making. Despite several advances in classical navigation methods in the last few decades, current navigation agents struggle at long-term semantic navigation tasks. In the first part of the thesis, we discuss our work on short-term navigation using end-to-end…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Reinforcement Learning in Robotics · AI-based Problem Solving and Planning
