Rethinking Event-Based Object Dtection through Representation-Level Temporal Aggregation and Model-Level Hypergraph Reasoning
Meisen Wang, Hao Deng, Wei Bao, Ma Yuanxiao, Chengjie Wang, Zhiqiang Tian, Shaoyi Du, Siqi Li

TL;DR
This paper introduces Ev-DTAD, a novel event-based object detection framework that combines explicit temporal encoding with hypergraph reasoning to improve detection accuracy and efficiency.
Contribution
The paper proposes a unified framework integrating Hierarchical Temporal Aggregation and Hypergraph Temporal Fusion for enhanced event-based object detection.
Findings
Ev-DTAD achieves higher mAP on multiple datasets (+0.8, +0.5, +3.0)
Ev-DTAD is 1.6 to 2.0 times faster than previous methods
The approach demonstrates a strong accuracy-efficiency trade-off
Abstract
Event cameras provide microsecond-level temporal resolution, low latency, and high dynamic range, offering potential for perception under fast motion and challenging illumination conditions. However, existing Event-based Object Detection (EOD) methods face limitations at both the representation and model levels: prior event representations usually encode temporal information indirectly through redundant structures, while detection models struggle to explicitly aggregate fragmented event responses into coherent high-order object features. To address these limitations, we present Event Dual Temporal-Relational Aggregation Detector (Ev-DTAD), a unified EOD framework that integrates representation-level temporal encoding with model-level temporal-hypergraph reasoning. Specifically, we introduce Hierarchical Temporal Aggregation (HTA), a compact three-channel pseudo-RGB representation that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
