EventAug: Multifaceted Spatio-Temporal Data Augmentation Methods for Event-based Learning
Yukun Tian, Hao Chen, Yongjian Deng, Feihong Shen, Kepan Liu, Wei You,, Ziyang Zhang

TL;DR
EventAug introduces novel spatio-temporal data augmentation techniques for event-based learning, significantly enhancing model robustness and accuracy across various tasks by enriching motion diversity and object variants.
Contribution
The paper presents a systematic augmentation scheme, EventAug, with methods like MSTI, SSEM, and TSEM to improve event data diversity and model performance.
Findings
Achieved a 4.87% accuracy gain on DVS128 Gesture
Enhanced model robustness to occlusions and varied speeds
Consistently improved performance across different tasks and backbones
Abstract
The event camera has demonstrated significant success across a wide range of areas due to its low time latency and high dynamic range. However, the community faces challenges such as data deficiency and limited diversity, often resulting in over-fitting and inadequate feature learning. Notably, the exploration of data augmentation techniques in the event community remains scarce. This work aims to address this gap by introducing a systematic augmentation scheme named EventAug to enrich spatial-temporal diversity. In particular, we first propose Multi-scale Temporal Integration (MSTI) to diversify the motion speed of objects, then introduce Spatial-salient Event Mask (SSEM) and Temporal-salient Event Mask (TSEM) to enrich object variants. Our EventAug can facilitate models learning with richer motion patterns, object variants and local spatio-temporal relations, thus improving model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Data Stream Mining Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
