PanSt3R: Multi-view Consistent Panoptic Segmentation

Lojze Zust; Yohann Cabon; Juliette Marrie; Leonid Antsfeld; Boris Chidlovskii; Jerome Revaud; Gabriela Csurka

arXiv:2506.21348·cs.CV·June 27, 2025

PanSt3R: Multi-view Consistent Panoptic Segmentation

Lojze Zust, Yohann Cabon, Juliette Marrie, Leonid Antsfeld, Boris Chidlovskii, Jerome Revaud, Gabriela Csurka

PDF

Open Access

TL;DR

PanSt3R is a fast, scalable 3D panoptic segmentation method that jointly predicts geometry and segmentation without test-time optimization, outperforming existing approaches on multiple benchmarks.

Contribution

It introduces PanSt3R, a unified approach that eliminates test-time optimization and enhances multi-view 3D panoptic segmentation with semantic awareness.

Findings

01

Achieves state-of-the-art performance on benchmarks.

02

Runs significantly faster than existing methods.

03

Provides effective novel-view prediction capabilities.

Abstract

Panoptic segmentation of 3D scenes, involving the segmentation and classification of object instances in a dense 3D reconstruction of a scene, is a challenging problem, especially when relying solely on unposed 2D images. Existing approaches typically leverage off-the-shelf models to extract per-frame 2D panoptic segmentations, before optimizing an implicit geometric representation (often based on NeRF) to integrate and fuse the 2D predictions. We argue that relying on 2D panoptic segmentation for a problem inherently 3D and multi-view is likely suboptimal as it fails to leverage the full potential of spatial relationships across views. In addition to requiring camera parameters, these approaches also necessitate computationally expensive test-time optimization for each scene. Instead, in this work, we propose a unified and integrated approach PanSt3R, which eliminates the need for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques