Learning to Detect Objects with a 1 Megapixel Event Camera

Etienne Perot; Pierre de Tournemire; Davide Nitti; Jonathan Masci,; Amos Sironi

arXiv:2009.13436·cs.CV·December 10, 2020·142 cites

Learning to Detect Objects with a 1 Megapixel Event Camera

Etienne Perot, Pierre de Tournemire, Davide Nitti, Jonathan Masci,, Amos Sironi

PDF

Open Access 1 Video

TL;DR

This paper introduces a high-resolution dataset and a novel recurrent deep learning architecture for event-based object detection, demonstrating that direct training on raw event data can match traditional frame-based methods in accuracy.

Contribution

The authors release the first large-scale high-resolution dataset for event-based detection and propose a recurrent model with a temporal consistency loss for improved performance.

Findings

01

The model outperforms feed-forward event-based architectures.

02

Training directly on raw events is more efficient and accurate.

03

Performance is comparable to traditional frame-based detectors.

Abstract

Event cameras encode visual information with high temporal precision, low data-rate, and high-dynamic range. Thanks to these characteristics, event cameras are particularly suited for scenarios with high motion, challenging lighting conditions and requiring low latency. However, due to the novelty of the field, the performance of event-based systems on many vision tasks is still lower compared to conventional frame-based solutions. The main reasons for this performance gap are: the lower spatial resolution of event sensors, compared to frame cameras; the lack of large-scale training datasets; the absence of well established deep learning architectures for event-based processing. In this paper, we address all these problems in the context of an event-based object detection task. First, we publicly release the first high-resolution large-scale dataset for object detection. The dataset…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Learning to Detect Objects with a 1 Megapixel Event Camera· slideslive

Taxonomy

TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Machine Learning and ELM