QuadricFormer: Scene as Superquadrics for 3D Semantic Occupancy Prediction

Sicheng Zuo; Wenzhao Zheng; Xiaoyong Han; Longchao Yang; Yong Pan; Jiwen Lu

arXiv:2506.10977·cs.CV·June 13, 2025

QuadricFormer: Scene as Superquadrics for 3D Semantic Occupancy Prediction

Sicheng Zuo, Wenzhao Zheng, Xiaoyong Han, Longchao Yang, Yong Pan, Jiwen Lu

PDF

Open Access 1 Repo

TL;DR

QuadricFormer introduces a novel superquadric-based approach for 3D occupancy prediction, efficiently modeling complex scene structures with fewer primitives, outperforming existing methods in accuracy and efficiency.

Contribution

The paper proposes using superquadrics as scene primitives and develops a probabilistic mixture model, significantly improving efficiency and modeling capability over prior voxel and Gaussian-based methods.

Findings

01

Achieves state-of-the-art accuracy on nuScenes dataset.

02

Demonstrates superior efficiency with fewer primitives.

03

Outperforms voxel-based and Gaussian-based methods.

Abstract

3D occupancy prediction is crucial for robust autonomous driving systems as it enables comprehensive perception of environmental structures and semantics. Most existing methods employ dense voxel-based scene representations, ignoring the sparsity of driving scenes and resulting in inefficiency. Recent works explore object-centric representations based on sparse Gaussians, but their ellipsoidal shape prior limits the modeling of diverse structures. In real-world driving scenes, objects exhibit rich geometries (e.g., cuboids, cylinders, and irregular shapes), necessitating excessive ellipsoidal Gaussians densely packed for accurate modeling, which leads to inefficient representations. To address this, we propose to use geometrically expressive superquadrics as scene primitives, enabling efficient representation of complex structures with fewer primitives through their inherent shape…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zuosc19/quadricformer
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Robotics and Sensor-Based Localization · Advanced Vision and Imaging