TL;DR
TFusionOcc introduces a novel T-primitive-based multi-sensor fusion framework that enhances 3D occupancy prediction for autonomous vehicles, effectively modeling complex structures and integrating camera and LiDAR data.
Contribution
The paper proposes a new T-primitive representation and a probabilistic fusion architecture for improved 3D occupancy prediction in autonomous driving.
Findings
Achieves state-of-the-art performance on nuScenes dataset.
Demonstrates robustness under various corruption scenarios.
Effectively models complex, non-convex structures with T-primitives.
Abstract
The prediction of 3D semantic occupancy enables autonomous vehicles (AVs) to perceive the fine-grained geometric and semantic scene structure for safe navigation and decision-making. Existing methods mainly rely on either voxel-based representations, which incur redundant computation over empty regions, or on object-centric Gaussian primitives, which are limited in modeling complex, non-convex, and asymmetric structures. In this paper, we present TFusionOcc, a T-primitive-based object-centric multi-sensor fusion framework for 3D semantic occupancy prediction. Specifically, we introduce a family of Students t-distribution-based T-primitives, including the plain T-primitive, T-Superquadric, and deformable T-Superquadric with inverse warping, where the deformable T-Superquadric serves as the key geometry-enhancing primitive. We further develop a unified probabilistic formulation based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
