PanORama: Multiview Consistent Panoptic Segmentation in Operating Rooms

Tuna G\"urb\"uz; Ege \"Ozsoy; Tony Danjun Wang; Nassir Navab

arXiv:2603.19920·cs.CV·March 23, 2026

PanORama: Multiview Consistent Panoptic Segmentation in Operating Rooms

Tuna G\"urb\"uz, Ege \"Ozsoy, Tony Danjun Wang, Nassir Navab

PDF

Open Access

TL;DR

PanORama introduces a multiview-consistent panoptic segmentation method for operating rooms, improving spatial understanding without camera calibration and outperforming previous methods on key datasets.

Contribution

It is the first to achieve multiview-consistent panoptic segmentation in ORs, modeling cross-view interactions within the backbone in a single pass.

Findings

01

Achieves over 70% Panoptic Quality on MM-OR and 4D-OR datasets.

02

Outperforms previous state-of-the-art methods.

03

Calibration-free and generalizes to unseen viewpoints.

Abstract

Operating rooms (ORs) are cluttered, dynamic, highly occluded environments, where reliable spatial understanding is essential for situational awareness during complex surgical workflows. Achieving spatial understanding for panoptic segmentation from sparse multiview images poses a fundamental challenge, as limited visibility in a subset of views often leads to mispredictions across cameras. To this end, we introduce PanORama, the first panoptic segmentation for the operating room that is multiview-consistent by design. By modeling cross-view interactions at the feature level inside the backbone in a single forward pass, view consistency emerges directly rather than through post-hoc refinement. We evaluate on the MM-OR and 4D-OR datasets, achieving >70% Panoptic Quality (PQ) performance, and outperforming the previous state of the art. Importantly, PanORama is calibration-free, requiring…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Surgical Simulation and Training