A Recurrent YOLOv8-based framework for Event-Based Object Detection

Diego A. Silva; Kamilya Smagulova; Ahmed Elsheikh; Mohammed E. Fouda,; Ahmed M. Eltawil

arXiv:2408.05321·cs.CV·August 13, 2024

A Recurrent YOLOv8-based framework for Event-Based Object Detection

Diego A. Silva, Kamilya Smagulova, Ahmed Elsheikh, Mohammed E. Fouda,, Ahmed M. Eltawil

PDF

Open Access

TL;DR

This paper presents ReYOLOv8, a novel event-based object detection framework that improves accuracy and efficiency in challenging environments by leveraging spatiotemporal modeling and specialized data augmentation techniques.

Contribution

The study introduces ReYOLOv8, integrating event data encoding and augmentation methods to enhance detection performance and reduce model complexity in real-time applications.

Findings

01

Outperformed comparable approaches on GEN1 dataset with up to 5% mAP improvement.

02

Achieved significant model size reduction of up to 14.5x while maintaining accuracy.

03

Enhanced detection speed, processing between 9.2ms and 15.5ms per frame.

Abstract

Object detection is crucial in various cutting-edge applications, such as autonomous vehicles and advanced robotics systems, primarily relying on data from conventional frame-based RGB sensors. However, these sensors often struggle with issues like motion blur and poor performance in challenging lighting conditions. In response to these challenges, event-based cameras have emerged as an innovative paradigm. These cameras, mimicking the human eye, demonstrate superior performance in environments with fast motion and extreme lighting conditions while consuming less power. This study introduces ReYOLOv8, an advanced object detection framework that enhances a leading frame-based detection system with spatiotemporal modeling capabilities. We implemented a low-latency, memory-efficient method for encoding event data to boost the system's performance. We also developed a novel data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Infrared Target Detection Methodologies

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings