Fast Motion Understanding with Spatiotemporal Neural Networks and Dynamic Vision Sensors
Anthony Bisulco, Fernando Cladera Ojeda, Volkan Isler, Daniel D. Lee

TL;DR
This paper introduces a DVS-based neural network system that rapidly estimates high-speed object collision time and point, inspired by insect vision, enabling quick reactions in fast-motion scenarios.
Contribution
The paper presents a novel spatiotemporal neural network architecture that processes event-based data from DVS sensors for real-time high-speed motion understanding, outperforming traditional methods.
Findings
Achieved 24.73° angular error in collision point prediction.
Predicted collision time with 25.03% median error.
Successfully detected objects moving at 23.4 m/s.
Abstract
This paper presents a Dynamic Vision Sensor (DVS) based system for reasoning about high speed motion. As a representative scenario, we consider the case of a robot at rest reacting to a small, fast approaching object at speeds higher than 15m/s. Since conventional image sensors at typical frame rates observe such an object for only a few frames, estimating the underlying motion presents a considerable challenge for standard computer vision systems and algorithms. In this paper we present a method motivated by how animals such as insects solve this problem with their relatively simple vision systems. Our solution takes the event stream from a DVS and first encodes the temporal events with a set of causal exponential filters across multiple time scales. We couple these filters with a Convolutional Neural Network (CNN) to efficiently extract relevant spatiotemporal features. The combined…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
