Self-Supervised Event Representations: Towards Accurate, Real-Time Perception on SoC FPGAs
Kamil Jeziorek, Tomasz Kryjak

TL;DR
This paper presents a self-supervised recurrent method for encoding event camera data with high temporal fidelity, validated on FPGA hardware for real-time, power-efficient perception.
Contribution
It introduces a novel self-supervised GRU-based event representation method and demonstrates its hardware implementation on FPGA for the first time.
Findings
Outperforms aggregation-based baselines in object detection accuracy.
Achieves sub-microsecond latency on FPGA.
Maintains low power consumption between 1-2 W.
Abstract
Event cameras offer significant advantages over traditional frame-based sensors. These include microsecond temporal resolution, robustness under varying lighting conditions and low power consumption. Nevertheless, the effective processing of their sparse, asynchronous event streams remains challenging. Existing approaches to this problem can be categorised into two distinct groups. The first group involves the direct processing of event data with neural models, such as Spiking Neural Networks or Graph Convolutional Neural Networks. However, this approach is often accompanied by a compromise in terms of qualitative performance. The second group involves the conversion of events into dense representations with handcrafted aggregation functions, which can boost accuracy at the cost of temporal fidelity. This paper introduces a novel Self-Supervised Event Representation (SSER) method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Neural Networks and Reservoir Computing
