Towards Accurate Single Panoramic 3D Detection: A Semantic Gaussian Centric Approach

Kanglin Ning; Yiran Zhao; Wenrui Li; Shaoru Sun; Xingtao Wang; Xiaopeng Fan

arXiv:2605.14601·cs.CV·May 15, 2026

Towards Accurate Single Panoramic 3D Detection: A Semantic Gaussian Centric Approach

Kanglin Ning, Yiran Zhao, Wenrui Li, Shaoru Sun, Xingtao Wang, Xiaopeng Fan

PDF

TL;DR

This paper introduces PanoGSDet, a novel panoramic 3D detection framework using continuous semantic Gaussian representations to improve accuracy over existing grid-based methods.

Contribution

It proposes a monocular panoramic 3D detection approach that models features with continuous semantic Gaussians, enhancing geometric continuity and representation efficiency.

Findings

01

Outperforms existing methods on Structured3D dataset

02

Effectively models spherical features with semantic Gaussians

03

Refines 3D bounding boxes through Gaussian optimization

Abstract

Three-dimensional object detection in panoramic imagery is crucial for comprehensive scene understanding, yet accurately mapping 2D features to 3D remains a significant challenge. Prevailing methods often project 2D features onto discrete 3D grids, which break geometric continuity and limit representation efficiency. To overcome this limitation, this paper proposes PanoGSDet, a monocular panoramic 3D detection framework built upon continuous semantic 3D Gaussian representations. The proposed framework comprises a panoramic depth estimation component and a semantic Gaussian component. The panoramic depth estimation component extracts the equirectangular semantic and depth features from the monocular panorama input. The semantic Gaussian component includes a semantic Gaussian lifting module that projects spherical features into 3D semantic Gaussians, a semantic Gaussian optimization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.