Event Transformer. A sparse-aware solution for efficient event data processing
Alberto Sabater, Luis Montesano, Ana C. Murillo

TL;DR
Event Transformer (EvT) is a novel, efficient, and accurate framework for event data processing that leverages event-data properties with a patch-based representation and transformer architecture, suitable for low-resource environments.
Contribution
The paper introduces a new patch-based event representation and a compact transformer architecture, achieving high efficiency and accuracy for event data processing.
Findings
EvT achieves comparable or better accuracy than state-of-the-art methods.
EvT requires significantly less computation resources.
EvT operates with minimal latency on GPU and CPU.
Abstract
Event cameras are sensors of great interest for many applications that run in low-resource and challenging environments. They log sparse illumination changes with high temporal resolution and high dynamic range, while they present minimal power consumption. However, top-performing methods often ignore specific event-data properties, leading to the development of generic but computationally expensive algorithms. Efforts toward efficient solutions usually do not achieve top-accuracy results for complex tasks. This work proposes a novel framework, Event Transformer (EvT), that effectively takes advantage of event-data properties to be highly efficient and accurate. We introduce a new patch-based event representation and a compact transformer-like architecture to process it. EvT is evaluated on different event-based benchmarks for action and gesture recognition. Evaluation results show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Neural Networks and Reservoir Computing
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dropout · Absolute Position Encodings · Label Smoothing · Softmax · Layer Normalization · Adam · Residual Connection
