Energy-Efficient Deep Reinforcement Learning with Spiking Transformers
Mohammad Irfan Uddin, Nishad Tasnim, Md Omor Faruk, Zejian Zhou

TL;DR
This paper introduces a Spike-Transformer Reinforcement Learning algorithm that combines energy-efficient spiking neural networks with Transformer-like decision-making, achieving high performance with lower energy consumption in complex tasks.
Contribution
The paper presents a novel SNN-based Transformer architecture for reinforcement learning that improves energy efficiency while maintaining high policy performance.
Findings
Significantly improved policy performance over traditional Transformers.
Enhanced energy efficiency demonstrated in numerical experiments.
Effective processing of spatio-temporal patterns over multiple time steps.
Abstract
Agent-based Transformers have been widely adopted in recent reinforcement learning advances due to their demonstrated ability to solve complex tasks. However, the high computational complexity of Transformers often results in significant energy consumption, limiting their deployment in real-world autonomous systems. Spiking neural networks (SNNs), with their biologically inspired structure, offer an energy-efficient alternative for machine learning. In this paper, a novel Spike-Transformer Reinforcement Learning (STRL) algorithm that combines the energy efficiency of SNNs with the powerful decision-making capabilities of reinforcement learning is developed. Specifically, an SNN using multi-step Leaky Integrate-and-Fire (LIF) neurons and attention mechanisms capable of processing spatio-temporal patterns over multiple time steps is designed. The architecture is further enhanced with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Modular Robots and Swarm Intelligence
MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Label Smoothing · Dropout · Adam · Multi-Head Attention · Dense Connections · Layer Normalization · Spiking Neural Networks
