Fast Occupancy Network
Mingjie Lu, Yuanxian Huang, Ji Liu, Xingliang Huang, Dong Li, Jinzhang, Peng, Lu Tian, and Emad Barsoum

TL;DR
This paper introduces a fast and efficient Occupancy Network for autonomous driving that improves inference speed and accuracy by using deformable 2D convolutions, a feature pyramid network, and a cost-free segmentation branch.
Contribution
The authors propose a simplified, faster Occupancy Network model with novel modules that significantly reduce computational cost while enhancing performance.
Findings
Outperforms existing methods in accuracy and speed
Surpasses SOTA OCCNet by 1.7% in accuracy
Achieves about 3X inference speedup
Abstract
Occupancy Network has recently attracted much attention in autonomous driving. Instead of monocular 3D detection and recent bird's eye view(BEV) models predicting 3D bounding box of obstacles, Occupancy Network predicts the category of voxel in specified 3D space around the ego vehicle via transforming 3D detection task into 3D voxel segmentation task, which has much superiority in tackling category outlier obstacles and providing fine-grained 3D representation. However, existing methods usually require huge computation resources than previous methods, which hinder the Occupancy Network solution applying in intelligent driving systems. To address this problem, we make an analysis of the bottleneck of Occupancy Network inference cost, and present a simple and fast Occupancy Network model, which adopts a deformable 2D convolutional layer to lift BEV feature to 3D voxel feature and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Automated Systems
MethodsSoftmax · Attention Is All You Need
