TL;DR
This paper introduces HiPR, a novel camera-LiDAR occupancy framework that adaptively reparameterizes the projection space using height priors, improving scene understanding and outperforming existing methods.
Contribution
The paper proposes Height-Guided Projection Reparameterization (HiPR), which adaptively adjusts the projection space based on height priors to better capture scene geometry.
Findings
HiPR outperforms state-of-the-art methods in occupancy prediction.
The adaptive reparameterization improves feature aggregation accuracy.
The approach maintains real-time inference capabilities.
Abstract
3D occupancy prediction aims to infer dense, voxel-wise scene semantics from sensor observations, where the 2D-to-3D view transformation serves as a crucial step in bridging image features and volumetric representations. Most previous methods rely on a fixed projection space, where 3D reference points are uniformly sampled along pillars. However, such sampling struggles to capture the sparsity and height variations of real-world scenes, leading to ambiguous correspondences and unreliable feature aggregation. To address these challenges, we propose HiPR, a camera-LiDAR occupancy framework with Height-Guided Projection Reparameterization. HiPR first encodes LiDAR into a BEV height map to capture the maximum height of the point cloud. HiPR then adjusts the sampling range of each pillar using the height prior, enabling adaptive reparameterization of the projection space. As a result, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
