Agents that Listen: High-Throughput Reinforcement Learning with Multiple Sensory Systems
Shashank Hegde, Anssi Kanervisto, Aleksei Petrenko

TL;DR
This paper introduces a new simulation environment with audio inputs for reinforcement learning, demonstrating that agents can learn to recognize sounds, follow instructions, and outperform vision-only agents in complex tasks like Doom.
Contribution
It presents a novel audio-visual reinforcement learning environment and evaluates different models, advancing multi-sensory perception research in artificial agents.
Findings
Agents can recognize sounds and follow natural language instructions.
Audio-visual agents outperform vision-only agents in Doom.
New simulation environment facilitates multi-sensory RL research.
Abstract
Humans and other intelligent animals evolved highly sophisticated perception systems that combine multiple sensory modalities. On the other hand, state-of-the-art artificial agents rely mostly on visual inputs or structured low-dimensional observations provided by instrumented environments. Learning to act based on combined visual and auditory inputs is still a new topic of research that has not been explored beyond simple scenarios. To facilitate progress in this area we introduce a new version of VizDoom simulator to create a highly efficient learning environment that provides raw audio observations. We study the performance of different model architectures in a series of tasks that require the agent to recognize sounds and execute instructions given in natural language. Finally, we train our agent to play the full game of Doom and find that it can consistently defeat a traditional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning
