Event Transformer

Bin Jiang; Zhihao Li; M. Salman Asif; Xun Cao; Zhan Ma

arXiv:2204.05172·cs.CV·June 13, 2024·1 cites

Event Transformer

Bin Jiang, Zhihao Li, M. Salman Asif, Xun Cao, Zhan Ma

PDF

Open Access 1 Repo

TL;DR

This paper introduces Event Transformer, a novel event-based representation and attention mechanism that preserves detailed spatiotemporal information for vision tasks, achieving competitive results with minimal computational resources.

Contribution

It proposes a token-based event representation and a Three-way Attention mechanism, enhancing spatiotemporal correlation modeling in event-based vision tasks.

Findings

01

Competitive performance on object classification and optical flow estimation

02

Minimal computational resource requirements

03

Effective preservation of event sequence details

Abstract

The event camera's low power consumption and ability to capture microsecond brightness changes make it attractive for various computer vision tasks. Existing event representation methods typically convert events into frames, voxel grids, or spikes for deep neural networks (DNNs). However, these approaches often sacrifice temporal granularity or require specialized devices for processing. This work introduces a novel token-based event representation, where each event is considered a fundamental processing unit termed an event-token. This approach preserves the sequence's intricate spatiotemporal attributes at the event level. Moreover, we propose a Three-way Attention mechanism in the Event Transformer Block (ETB) to collaboratively construct temporal and spatial correlations between events. We compare our proposed token-based event representation extensively with other prevalent methods…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

njuvision/eventtransformer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · EEG and Brain-Computer Interfaces · Neural dynamics and brain function

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Dense Connections · Layer Normalization · Absolute Position Encodings · Softmax · Residual Connection