Panoramic Panoptic Segmentation: Insights Into Surrounding Parsing for Mobile Agents via Unsupervised Contrastive Learning
Alexander Jaus, Kailun Yang, Rainer Stiefelhagen

TL;DR
This paper introduces a panoramic panoptic segmentation framework that enables scene understanding for mobile agents using unsupervised contrastive learning, effectively transferring knowledge from standard images to panoramic images without needing annotated panoramic data.
Contribution
The work presents a novel domain adaptation method using dense contrastive learning for panoramic segmentation, along with the WildPPS dataset for advancing surrounding perception research.
Findings
Achieved 3.5-6.5% improvement in Panoptic Quality over non-adapted models.
Proposed a cost-effective domain transfer approach without target domain image access.
Published the WildPPS dataset to support panoramic scene understanding research.
Abstract
In this work, we introduce panoramic panoptic segmentation, as the most holistic scene understanding, both in terms of Field of View (FoV) and image-level understanding for standard camera-based input. A complete surrounding understanding provides a maximum of information to a mobile agent. This is essential information for any intelligent vehicle to make informed decisions in a safety-critical dynamic environment such as real-world traffic. In order to overcome the lack of annotated panoramic images, we propose a framework which allows model training on standard pinhole images and transfers the learned features to the panoramic domain in a cost-minimizing way. The domain shift from pinhole to panoramic images is non-trivial as large objects and surfaces are heavily distorted close to the image border regions and look different across the two domains. Using our proposed method with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection
