Biological Neurons Compete with Deep Reinforcement Learning in Sample Efficiency in a Simulated Gameworld
Moein Khajehnejad, Forough Habibollahi, Aswin Paul, Adeel Razi, Brett, J. Kagan

TL;DR
Biological neural networks demonstrate higher sample efficiency and faster learning than deep reinforcement learning algorithms in a simulated Pong game, especially with limited data.
Contribution
This study provides a direct comparison between biological neural systems and deep RL algorithms, highlighting biological systems' superior sample efficiency in a simplified task.
Findings
Biological neurons outperform deep RL in sample-limited scenarios.
Biological systems learn faster across various input types.
Biological neural networks show higher overall learning efficiency.
Abstract
How do biological systems and machine learning algorithms compare in the number of samples required to show significant improvements in completing a task? We compared the learning efficiency of in vitro biological neural networks to the state-of-the-art deep reinforcement learning (RL) algorithms in a simplified simulation of the game `Pong'. Using DishBrain, a system that embodies in vitro neural networks with in silico computation using a high-density multi-electrode array, we contrasted the learning rate and the performance of these biological systems against time-matched learning from three state-of-the-art deep RL algorithms (i.e., DQN, A2C, and PPO) in the same game environment. This allowed a meaningful comparison between biological neural systems and deep RL. We find that when samples are limited to a real-world time course, even these very simple biological cultures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications · Reinforcement Learning in Robotics · Neural Networks and Applications
MethodsDense Connections · A2C · Q-Learning · Convolution · Deep Q-Network
