WD-DETR: Wavelet Denoising-Enhanced Real-Time Object Detection Transformer for Robot Perception with Event Cameras
Yangjie Cui, Boyang Gao, Yiwei Zhang, Xin Dong, Jinwu Xiang, Daochun Li, Zhan Tu

TL;DR
WD-DETR introduces a wavelet denoising-enhanced transformer architecture for real-time object detection using event cameras, effectively reducing noise, improving detection accuracy, and achieving high frame rates suitable for robotic perception.
Contribution
The paper proposes a novel wavelet denoising method integrated into a transformer-based network for event camera data, enhancing real-time detection performance and robustness.
Findings
Outperforms state-of-the-art methods on three datasets.
Achieves approximately 35 FPS on NVIDIA Jetson Orin NX.
Effectively reduces noise in dense event representations.
Abstract
Previous studies on event camera sensing have demonstrated certain detection performance using dense event representations. However, the accumulated noise in such dense representations has received insufficient attention, which degrades the representation quality and increases the likelihood of missed detections. To address this challenge, we propose the Wavelet Denoising-enhanced DEtection TRansformer, i.e., WD-DETR network, for event cameras. In particular, a dense event representation is presented first, which enables real-time reconstruction of events as tensors. Then, a wavelet transform method is designed to filter noise in the event representations. Such a method is integrated into the backbone for feature extraction. The extracted features are subsequently fed into a transformer-based network for object prediction. To further reduce inference time, we incorporate the Dynamic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Age of Information Optimization
MethodsConvolution
