PanoMixSwap Panorama Mixing via Structural Swapping for Indoor Scene Understanding
Yu-Cheng Hsieh, Cheng Sun, Suraj Dengale, Min Sun

TL;DR
PanoMixSwap is a novel data augmentation technique for indoor panoramic images that combines background, furniture, and layout from different images to enhance dataset diversity and improve scene understanding tasks.
Contribution
The paper introduces PanoMixSwap, a new method for mixing parts of indoor panoramic images to generate diverse training data for better scene understanding.
Findings
Improves semantic segmentation accuracy on indoor panoramas.
Enhances layout estimation performance.
State-of-the-art results achieved with augmented data.
Abstract
The volume and diversity of training data are critical for modern deep learningbased methods. Compared to the massive amount of labeled perspective images, 360 panoramic images fall short in both volume and diversity. In this paper, we propose PanoMixSwap, a novel data augmentation technique specifically designed for indoor panoramic images. PanoMixSwap explicitly mixes various background styles, foreground furniture, and room layouts from the existing indoor panorama datasets and generates a diverse set of new panoramic images to enrich the datasets. We first decompose each panoramic image into its constituent parts: background style, foreground furniture, and room layout. Then, we generate an augmented image by mixing these three parts from three different images, such as the foreground furniture from one image, the background style from another image, and the room structure from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Vision and Imaging · Video Surveillance and Tracking Methods
