FPGA Architecture for Deep Learning and its application to Planetary Robotics
Pranay Gankidi, Jekan Thangavelautham

TL;DR
This paper develops a FPGA-based hardware accelerator for Q-learning with neural networks, enabling faster and more efficient autonomous learning for planetary rovers within space hardware constraints.
Contribution
It introduces a novel FPGA architecture for Q-learning with neural networks, demonstrating significant speedup and efficiency improvements over traditional CPUs for space applications.
Findings
Up to 43-fold speedup on Virtex 7 FPGA compared to Intel i5 CPU.
Effective implementation of single neuron and multilayer perceptron Q-learning.
Simulated architecture shows promising performance and power efficiency.
Abstract
Autonomous control systems onboard planetary rovers and spacecraft benefit from having cognitive capabilities like learning so that they can adapt to unexpected situations in-situ. Q-learning is a form of reinforcement learning and it has been efficient in solving certain class of learning problems. However, embedded systems onboard planetary rovers and spacecraft rarely implement learning algorithms due to the constraints faced in the field, like processing power, chip size, convergence rate and costs due to the need for radiation hardening. These challenges present a compelling need for a portable, low-power, area efficient hardware accelerator to make learning algorithms practical onboard space hardware. This paper presents a FPGA implementation of Q-learning with Artificial Neural Networks (ANN). This method matches the massive parallelism inherent in neural network software with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Q-Learning
