Energy-Efficient Deep Reinforcement Learning with Spiking Transformers

Mohammad Irfan Uddin; Nishad Tasnim; Md Omor Faruk; Zejian Zhou

arXiv:2505.14533·cs.LG·May 21, 2025

Energy-Efficient Deep Reinforcement Learning with Spiking Transformers

Mohammad Irfan Uddin, Nishad Tasnim, Md Omor Faruk, Zejian Zhou

PDF

Open Access

TL;DR

This paper introduces a Spike-Transformer Reinforcement Learning algorithm that combines energy-efficient spiking neural networks with Transformer-like decision-making, achieving high performance with lower energy consumption in complex tasks.

Contribution

The paper presents a novel SNN-based Transformer architecture for reinforcement learning that improves energy efficiency while maintaining high policy performance.

Findings

01

Significantly improved policy performance over traditional Transformers.

02

Enhanced energy efficiency demonstrated in numerical experiments.

03

Effective processing of spatio-temporal patterns over multiple time steps.

Abstract

Agent-based Transformers have been widely adopted in recent reinforcement learning advances due to their demonstrated ability to solve complex tasks. However, the high computational complexity of Transformers often results in significant energy consumption, limiting their deployment in real-world autonomous systems. Spiking neural networks (SNNs), with their biologically inspired structure, offer an energy-efficient alternative for machine learning. In this paper, a novel Spike-Transformer Reinforcement Learning (STRL) algorithm that combines the energy efficiency of SNNs with the powerful decision-making capabilities of reinforcement learning is developed. Specifically, an SNN using multi-step Leaky Integrate-and-Fire (LIF) neurons and attention mechanisms capable of processing spatio-temporal patterns over multiple time steps is designed. The architecture is further enhanced with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Modular Robots and Swarm Intelligence

MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Label Smoothing · Dropout · Adam · Multi-Head Attention · Dense Connections · Layer Normalization · Spiking Neural Networks