Real-time Streaming Perception System for Autonomous Driving
Yongxiang Gu, Qianlei Wang, Xiaolin Qin

TL;DR
This paper introduces a real-time streaming perception system for autonomous driving that balances accuracy and latency, utilizing YOLOv5, data augmentation, and Transformers, achieving top-tier results in a competitive challenge.
Contribution
The paper presents a novel real-time streaming perception system that improves object detection performance for autonomous driving using enhanced YOLOv5-based methods with minimal inference cost.
Findings
Achieved 33.2 streaming AP on Argoverse-HD test set.
Significantly outperformed the baseline of 13.6 streaming AP.
Demonstrated effectiveness in a competitive challenge setting.
Abstract
Nowadays, plenty of deep learning technologies are being applied to all aspects of autonomous driving with promising results. Among them, object detection is the key to improve the ability of an autonomous agent to perceive its environment so that it can (re)act. However, previous vision-based object detectors cannot achieve satisfactory performance under real-time driving scenarios. To remedy this, we present the real-time steaming perception system in this paper, which is also the 2nd Place solution of Streaming Perception Challenge (Workshop on Autonomous Driving at CVPR 2021) for the detection-only track. Unlike traditional object detection challenges, which focus mainly on the absolute performance, streaming perception task requires achieving a balance of accuracy and latency, which is crucial for real-time autonomous driving. We adopt YOLOv5 as our basic framework, data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Autonomous Vehicle Technology and Safety · Video Surveillance and Tracking Methods
MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Dense Connections · Label Smoothing · Residual Connection · Softmax · Dropout · Adam
