Closing the loop between neural network simulators and the OpenAI Gym
Jakob Jordan, Philipp Weidel, Abigail Morrison

TL;DR
This paper introduces a new toolchain that connects neural network simulators with OpenAI Gym, enabling standardized benchmarking of biologically plausible reinforcement learning models across different environments.
Contribution
It presents a novel integration method that allows neural network simulators to utilize OpenAI Gym environments for easier comparison and reproducibility of reinforcement learning research.
Findings
Successfully implemented a neuronal actor-critic architecture in NEST
Trained the model on multiple OpenAI Gym environments
Demonstrated improved reproducibility and benchmarking capabilities
Abstract
Since the enormous breakthroughs in machine learning over the last decade, functional neural network models are of growing interest for many researchers in the field of computational neuroscience. One major branch of research is concerned with biologically plausible implementations of reinforcement learning, with a variety of different models developed over the recent years. However, most studies in this area are conducted with custom simulation scripts and manually implemented tasks. This makes it hard for other researchers to reproduce and build upon previous work and nearly impossible to compare the performance of different learning architectures. In this work, we present a novel approach to solve this problem, connecting benchmark tools from the field of machine learning and state-of-the-art neural network simulators from computational neuroscience. This toolchain enables…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Neural dynamics and brain function · Neural Networks and Reservoir Computing
