Learning to Detect Objects with a 1 Megapixel Event Camera
Etienne Perot, Pierre de Tournemire, Davide Nitti, Jonathan Masci,, Amos Sironi

TL;DR
This paper introduces a high-resolution dataset and a novel recurrent deep learning architecture for event-based object detection, demonstrating that direct training on raw event data can match traditional frame-based methods in accuracy.
Contribution
The authors release the first large-scale high-resolution dataset for event-based detection and propose a recurrent model with a temporal consistency loss for improved performance.
Findings
The model outperforms feed-forward event-based architectures.
Training directly on raw events is more efficient and accurate.
Performance is comparable to traditional frame-based detectors.
Abstract
Event cameras encode visual information with high temporal precision, low data-rate, and high-dynamic range. Thanks to these characteristics, event cameras are particularly suited for scenarios with high motion, challenging lighting conditions and requiring low latency. However, due to the novelty of the field, the performance of event-based systems on many vision tasks is still lower compared to conventional frame-based solutions. The main reasons for this performance gap are: the lower spatial resolution of event sensors, compared to frame cameras; the lack of large-scale training datasets; the absence of well established deep learning architectures for event-based processing. In this paper, we address all these problems in the context of an event-based object detection task. First, we publicly release the first high-resolution large-scale dataset for object detection. The dataset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Machine Learning and ELM
