EventAug: Multifaceted Spatio-Temporal Data Augmentation Methods for   Event-based Learning

Yukun Tian; Hao Chen; Yongjian Deng; Feihong Shen; Kepan Liu; Wei You,; Ziyang Zhang

arXiv:2409.11813·cs.CV·September 19, 2024

EventAug: Multifaceted Spatio-Temporal Data Augmentation Methods for Event-based Learning

Yukun Tian, Hao Chen, Yongjian Deng, Feihong Shen, Kepan Liu, Wei You,, Ziyang Zhang

PDF

Open Access

TL;DR

EventAug introduces novel spatio-temporal data augmentation techniques for event-based learning, significantly enhancing model robustness and accuracy across various tasks by enriching motion diversity and object variants.

Contribution

The paper presents a systematic augmentation scheme, EventAug, with methods like MSTI, SSEM, and TSEM to improve event data diversity and model performance.

Findings

01

Achieved a 4.87% accuracy gain on DVS128 Gesture

02

Enhanced model robustness to occlusions and varied speeds

03

Consistently improved performance across different tasks and backbones

Abstract

The event camera has demonstrated significant success across a wide range of areas due to its low time latency and high dynamic range. However, the community faces challenges such as data deficiency and limited diversity, often resulting in over-fitting and inadequate feature learning. Notably, the exploration of data augmentation techniques in the event community remains scarce. This work aims to address this gap by introducing a systematic augmentation scheme named EventAug to enrich spatial-temporal diversity. In particular, we first propose Multi-scale Temporal Integration (MSTI) to diversify the motion speed of objects, then introduce Spatial-salient Event Mask (SSEM) and Temporal-salient Event Mask (TSEM) to enrich object variants. Our EventAug can facilitate models learning with richer motion patterns, object variants and local spatio-temporal relations, thus improving model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Management and Algorithms · Data Stream Mining Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings