TL;DR
SpikeStereoNet is a novel brain-inspired neural framework that estimates stereo depth directly from spike streams, outperforming existing methods and demonstrating high data efficiency on synthetic and real datasets.
Contribution
It introduces the first stereo depth estimation method from raw spike streams, along with new benchmarks and datasets for spike-based stereo vision.
Findings
Outperforms existing methods on synthetic and real datasets
Effectively captures subtle edges and intensity shifts in challenging regions
Maintains high accuracy with reduced training data
Abstract
Conventional frame-based cameras often struggle with stereo depth estimation in rapidly changing scenes. In contrast, bio-inspired spike cameras emit asynchronous events at microsecond-level resolution, providing an alternative sensing modality. However, existing methods lack specialized stereo algorithms and benchmarks tailored to the spike data. To address this gap, we propose SpikeStereoNet, a brain-inspired framework and the first to estimate stereo depth directly from raw spike streams. The model fuses raw spike streams from two viewpoints and iteratively refines depth estimation through a recurrent spiking neural network (RSNN) update module. To benchmark our approach, we introduce a large-scale synthetic spike stream dataset and a real-world stereo spike dataset with dense depth annotations. SpikeStereoNet outperforms existing methods on both datasets by leveraging spike streams'…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
