Efficient Reinforcement Learning On Passive RRAM Crossbar Array
Arjun Tyagi, Shubham Sahay

TL;DR
This paper presents a novel Monte Carlo reinforcement learning implementation on passive RRAM crossbar arrays, achieving significant improvements in area efficiency and robustness against device variations and endurance issues.
Contribution
It introduces the first algorithm-hardware co-design for Monte Carlo RL on passive RRAM arrays, overcoming endurance limitations and outperforming prior digital and active RRAM implementations.
Findings
Outperforms prior implementations by over five orders of magnitude in area.
Demonstrates robustness against RRAM variations and endurance failures.
Achieves energy-efficient RL suitable for embedded neuromorphic systems.
Abstract
The unprecedented growth in the field of machine learning has led to the development of deep neuromorphic networks trained on labelled dataset with capability to mimic or even exceed human capabilities. However, for applications involving continuous decision making in unknown environments, such as rovers for space exploration, robots, unmanned aerial vehicles, etc., explicit supervision and generation of labelled data set is extremely difficult and expensive. Reinforcement learning (RL) allows the agents to take decisions without any (human/external) supervision or training on labelled dataset. However, the conventional implementations of RL on advanced digital CPUs/GPUs incur a significantly large power dissipation owing to their inherent von-Neumann architecture. Although crossbar arrays of emerging non-volatile memories such as resistive (R)RAMs with their innate capability to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Machine Learning and ELM · Ferroelectric and Negative Capacitance Devices
