Rethinking Event-Based Object Dtection through Representation-Level Temporal Aggregation and Model-Level Hypergraph Reasoning

Meisen Wang; Hao Deng; Wei Bao; Ma Yuanxiao; Chengjie Wang; Zhiqiang Tian; Shaoyi Du; Siqi Li

arXiv:2605.08825·cs.CV·May 15, 2026

Rethinking Event-Based Object Dtection through Representation-Level Temporal Aggregation and Model-Level Hypergraph Reasoning

Meisen Wang, Hao Deng, Wei Bao, Ma Yuanxiao, Chengjie Wang, Zhiqiang Tian, Shaoyi Du, Siqi Li

PDF

TL;DR

This paper introduces Ev-DTAD, a novel event-based object detection framework that combines explicit temporal encoding with hypergraph reasoning to improve detection accuracy and efficiency.

Contribution

The paper proposes a unified framework integrating Hierarchical Temporal Aggregation and Hypergraph Temporal Fusion for enhanced event-based object detection.

Findings

01

Ev-DTAD achieves higher mAP on multiple datasets (+0.8, +0.5, +3.0)

02

Ev-DTAD is 1.6 to 2.0 times faster than previous methods

03

The approach demonstrates a strong accuracy-efficiency trade-off

Abstract

Event cameras provide microsecond-level temporal resolution, low latency, and high dynamic range, offering potential for perception under fast motion and challenging illumination conditions. However, existing Event-based Object Detection (EOD) methods face limitations at both the representation and model levels: prior event representations usually encode temporal information indirectly through redundant structures, while detection models struggle to explicitly aggregate fragmented event responses into coherent high-order object features. To address these limitations, we present Event Dual Temporal-Relational Aggregation Detector (Ev-DTAD), a unified EOD framework that integrates representation-level temporal encoding with model-level temporal-hypergraph reasoning. Specifically, we introduce Hierarchical Temporal Aggregation (HTA), a compact three-channel pseudo-RGB representation that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.