TL;DR
GERD is a simulator that generates controlled event-based vision data with ground-truth transformations, facilitating geometric studies and model evaluation in event-based vision.
Contribution
It introduces GERD, a novel simulator for event-based vision data with precise geometric control and ground-truth, supporting noise models and sub-pixel motion.
Findings
GERD enables hypothesis-driven geometric studies in event-based vision.
The simulator supports multiple noise models and sub-pixel motion.
GERD is publicly available at github.com/ncskth/gerd.
Abstract
Event-based vision sensors offer high time resolution, high dynamic range, and low power consumption, yet event-based vision models lag behind conventional frame-based vision methods. We argue that this gap is partly due to the lack of principled study of the transformation groups that govern event-based visual streams. Motivated by the role that geometric and group-theoretic methods have played in advancing computer vision, we present GERD: a simulator for generating event-based recordings of objects under precisely controlled affine, Galilean, and temporal scaling transformations. By providing ground-truth transformations at each timestep, GERD enables hypothesis-driven and controlled studies of geometric properties that are otherwise impossible to isolate in real-world datasets. The simulator supports three noise models and sub-pixel motion as a complement to real sensor datasets. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
