RealNet: Combining Optimized Object Detection with Information Fusion Depth Estimation Co-Design Method on IoT
Zhuohao Li, Fandi Gou, Qixin De, Leqi Ding, Yuanhang Zhang, Yunze Cai

TL;DR
RealNet is a hybrid co-design approach that combines optimized object detection and depth estimation with information fusion on IoT devices, enhancing robustness and real-time performance for autonomous vehicles using monocular vision.
Contribution
The paper introduces a novel co-design method that fuses depth estimation and object detection algorithms to improve robustness and real-time performance on mobile platforms.
Findings
Achieved a prediction speed of 0.01s for object detection.
Effectively eliminated depth jitter and improved robustness.
Demonstrated suitability for high real-time requirements on IoT devices.
Abstract
Depth Estimation and Object Detection Recognition play an important role in autonomous driving technology under the guidance of deep learning artificial intelligence. We propose a hybrid structure called RealNet: a co-design method combining the model-streamlined recognition algorithm, the depth estimation algorithm with information fusion, and deploying them on the Jetson-Nano for unmanned vehicles with monocular vision sensors. We use ROS for experiment. The method proposed in this paper is suitable for mobile platforms with high real-time request. Innovation of our method is using information fusion to compensate the problem of insufficient frame rate of output image, and improve the robustness of target detection and depth estimation under monocular vision.Object Detection is based on YOLO-v5. We have simplified the network structure of its DarkNet53 and realized a prediction speed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Industrial Vision Systems and Defect Detection · Video Surveillance and Tracking Methods
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · PnP
