Agents that Listen: High-Throughput Reinforcement Learning with Multiple   Sensory Systems

Shashank Hegde; Anssi Kanervisto; Aleksei Petrenko

arXiv:2107.02195·cs.LG·July 7, 2021

Agents that Listen: High-Throughput Reinforcement Learning with Multiple Sensory Systems

Shashank Hegde, Anssi Kanervisto, Aleksei Petrenko

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new simulation environment with audio inputs for reinforcement learning, demonstrating that agents can learn to recognize sounds, follow instructions, and outperform vision-only agents in complex tasks like Doom.

Contribution

It presents a novel audio-visual reinforcement learning environment and evaluates different models, advancing multi-sensory perception research in artificial agents.

Findings

01

Agents can recognize sounds and follow natural language instructions.

02

Audio-visual agents outperform vision-only agents in Doom.

03

New simulation environment facilitates multi-sensory RL research.

Abstract

Humans and other intelligent animals evolved highly sophisticated perception systems that combine multiple sensory modalities. On the other hand, state-of-the-art artificial agents rely mostly on visual inputs or structured low-dimensional observations provided by instrumented environments. Learning to act based on combined visual and auditory inputs is still a new topic of research that has not been explored beyond simple scenarios. To facilitate progress in this area we introduce a new version of VizDoom simulator to create a highly efficient learning environment that provides raw audio observations. We study the performance of different model architectures in a series of tasks that require the agent to recognize sounds and execute instructions given in natural language. Finally, we train our agent to play the full game of Doom and find that it can consistently defeat a traditional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hegde95/Agents_that_Listen
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning