YCB-Ev SD: Synthetic event-vision dataset for 6DoF object pose estimation
Pavel Rojtberg, Julius K\"uhn

TL;DR
This paper introduces YCB-Ev SD, a large synthetic event-camera dataset for 6DoF object pose estimation, with comprehensive evaluation of event representations for CNN inference, advancing event-based vision research.
Contribution
The paper provides the first large-scale synthetic event dataset for 6DoF pose estimation and systematically evaluates event representations, highlighting the importance of polarity and linear decay.
Findings
Time-surfaces with linear decay outperform exponential decay.
Polarity information significantly improves pose estimation.
Linear temporal encoding better preserves motion details.
Abstract
We introduce YCB-Ev SD, a synthetic dataset of event-camera data at standard definition (SD) resolution for 6DoF object pose estimation. While synthetic data has become fundamental in frame-based computer vision, event-based vision lacks comparable comprehensive resources. Addressing this gap, we present 50,000 event sequences of 34 ms duration each, synthesized from Physically Based Rendering (PBR) scenes of YCB-Video objects following the Benchmark for 6D Object Pose (BOP) methodology. Our generation framework employs simulated linear camera motion to ensure complete scene coverage, including background activity. Through systematic evaluation of event representations for CNN-based inference, we demonstrate that time-surfaces with linear decay and dual-channel polarity encoding achieve superior pose estimation performance, outperforming exponential decay and single-channel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Human Pose and Action Recognition · Robotics and Sensor-Based Localization
