GET: Group Event Transformer for Event-Based Vision

Yansong Peng; Yueyi Zhang; Zhiwei Xiong; Xiaoyan Sun; Feng; Wu

arXiv:2310.02642·cs.CV·October 5, 2023·5 cites

GET: Group Event Transformer for Event-Based Vision

Yansong Peng, Yueyi Zhang, Zhiwei Xiong, Xiaoyan Sun, Feng, Wu

PDF

Open Access 2 Repos 1 Datasets

TL;DR

GET introduces a novel transformer backbone for event-based vision that effectively separates temporal-polarity from spatial information, leading to superior performance on classification and detection tasks.

Contribution

The paper proposes a new Group Event Transformer (GET) that decouples temporal-polarity information from spatial features using Group Tokens and dual self-attention, advancing event-based vision models.

Findings

01

GET outperforms state-of-the-art methods on multiple datasets.

02

The Group Token representation effectively captures asynchronous event information.

03

GET demonstrates versatility across classification and detection tasks.

Abstract

Event cameras are a type of novel neuromorphic sen-sor that has been gaining increasing attention. Existing event-based backbones mainly rely on image-based designs to extract spatial information within the image transformed from events, overlooking important event properties like time and polarity. To address this issue, we propose a novel Group-based vision Transformer backbone for Event-based vision, called Group Event Transformer (GET), which de-couples temporal-polarity information from spatial infor-mation throughout the feature extraction process. Specifi-cally, we first propose a new event representation for GET, named Group Token, which groups asynchronous events based on their timestamps and polarities. Then, GET ap-plies the Event Dual Self-Attention block, and Group Token Aggregation module to facilitate effective feature commu-nication and integration in both the spatial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

Charlieqaq/DailyDVS-200
dataset· 21 dl
21 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · EEG and Brain-Computer Interfaces

MethodsMulti-Head Attention · Dense Connections · Vision Transformer · Linear Layer · Label Smoothing · Absolute Position Encodings · Attention Is All You Need · Adam · Residual Connection · Layer Normalization