EFFOcc: Learning Efficient Occupancy Networks from Minimal Labels for Autonomous Driving
Yining Shi, Kun Jiang, Jinyu Miao, Ke Wang, Kangan Qian, Yunlong Wang,, Jiusi Li, Tuopu Wen, Mengmeng Yang, Yiliang Xu, Diange Yang

TL;DR
This paper introduces EFFOcc, an efficient framework for 3D occupancy prediction in autonomous driving that reduces model complexity and label requirements while maintaining state-of-the-art accuracy across multiple benchmarks.
Contribution
The paper presents a fusion-based OccNet using simple 2D operators and a multi-stage distillation method to achieve high accuracy with minimal labels and low computational cost.
Findings
Achieves 51.49 mIoU with 21.35M parameters on Occ3D-nuScenes.
Attains 94.38% of full-label performance with only 40% labeled data.
Outperforms existing methods in accuracy and efficiency on multiple benchmarks.
Abstract
3D occupancy prediction (3DOcc) is a rapidly rising and challenging perception task in the field of autonomous driving. Existing 3D occupancy networks (OccNets) are both computationally heavy and label-hungry. In terms of model complexity, OccNets are commonly composed of heavy Conv3D modules or transformers at the voxel level. Moreover, OccNets are supervised with expensive large-scale dense voxel labels. Model and data inefficiencies, caused by excessive network parameters and label annotation requirements, severely hinder the onboard deployment of OccNets. This paper proposes an EFFicient Occupancy learning framework, EFFOcc, that targets minimal network complexity and label requirements while achieving state-of-the-art accuracy. We first propose an efficient fusion-based OccNet that only uses simple 2D operators and improves accuracy to the state-of-the-art on three large-scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Video Surveillance and Tracking Methods · Automated Road and Building Extraction
