ESGN: Efficient Stereo Geometry Network for Fast 3D Object Detection
Aqi Gao, Yanwei Pang, Jing Nie, Jiale Cao, Yishun Guo

TL;DR
This paper introduces ESGN, a fast and efficient stereo geometry network that improves 3D object detection accuracy by generating geometry-aware features without complex aggregation, outperforming previous methods on KITTI.
Contribution
The paper proposes a novel efficient stereo geometry network with a geometry-aware feature generation module and a distillation scheme, enhancing accuracy without sacrificing speed.
Findings
ESGN outperforms YOLOStereo3D by 5.14% mAP on KITTI.
ESGN achieves a good balance between detection accuracy and inference speed.
The method effectively utilizes multi-scale stereo volumes and deep feature fusion.
Abstract
Fast stereo based 3D object detectors have made great progress recently. However, they lag far behind high-precision stereo based methods in accuracy. We argue that the main reason is due to the poor geometry-aware feature representation in 3D space. To solve this problem, we propose an efficient stereo geometry network (ESGN). The key in our ESGN is an efficient geometry-aware feature generation (EGFG) module. Our EGFG module first uses a stereo correlation and reprojection module to construct multi-scale stereo volumes in camera frustum space, second employs a multi-scale BEV projection and fusion module to generate multiple geometry-aware features. In these two steps, we adopt deep multi-scale information fusion for discriminative geometry-aware feature generation, without any complex aggregation networks. In addition, we introduce a deep geometry-aware feature distillation scheme to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods
MethodsKnowledge Distillation
