Transfer and Online Reinforcement Learning in STT-MRAM Based Embedded   Systems for Autonomous Drones

Insik Yoon; Aqeel Anwar; Titash Rakshit; Arijit Raychowdhury

arXiv:1905.06314·cs.OH·May 16, 2019·1 cites

Transfer and Online Reinforcement Learning in STT-MRAM Based Embedded Systems for Autonomous Drones

Insik Yoon, Aqeel Anwar, Titash Rakshit, Arijit Raychowdhury

PDF

Open Access

TL;DR

This paper proposes a hardware-software co-design for autonomous drones that enables real-time reinforcement learning using a hybrid memory system, significantly reducing energy consumption while maintaining performance.

Contribution

It introduces a transfer learning and reinforcement learning framework tailored for NVM-based embedded systems in drones, optimizing memory usage and energy efficiency.

Findings

01

Achieved 83.4% lower energy per image frame.

02

Enabled real-time RL updates on SRAM for improved responsiveness.

03

Maintained comparable performance to end-to-end RL methods.

Abstract

In this paper we present an algorithm-hardware codesign for camera-based autonomous flight in small drones. We show that the large write-latency and write-energy for nonvolatile memory (NVM) based embedded systems makes them unsuitable for real-time reinforcement learning (RL). We address this by performing transfer learning (TL) on metaenvironments and RL on the last few layers of a deep convolutional network. While the NVM stores the meta-model from TL, an on-die SRAM stores the weights of the last few layers. Thus all the real-time updates via RL are carried out on the SRAM arrays. This provides us with a practical platform with comparable performance as end-to-end RL and 83.4% lower energy per image frame

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Memory and Neural Computing