Panoptic-aware Image-to-Image Translation
Liyun Zhang, Photchara Ratsamee, Bowen Wang, Zhaojie Luo, Yuki, Uranishi, Manabu Higashida, Haruo Takemura

TL;DR
This paper introduces PanopticGAN, a novel image-to-image translation method that leverages panoptic perception to improve fidelity and object recognition in complex scenes by aligning object content with style codes.
Contribution
The paper proposes a panoptic-aware GAN framework that incorporates panoptic segmentation to enhance image translation quality and object detail preservation.
Findings
Significant improvement in image quality over existing methods
Enhanced object recognition performance in translated images
Effective alignment of object content and style codes
Abstract
Despite remarkable progress in image translation, the complex scene with multiple discrepant objects remains a challenging problem. The translated images have low fidelity and tiny objects in fewer details causing unsatisfactory performance in object recognition. Without thorough object perception (i.e., bounding boxes, categories, and masks) of images as prior knowledge, the style transformation of each object will be difficult to track in translation. We propose panoptic-aware generative adversarial networks (PanopticGAN) for image-to-image translation together with a compact panoptic segmentation dataset. The panoptic perception (i.e., foreground instances and background semantics of the image scene) is extracted to achieve alignment between object content codes of the input domain and panoptic-level style codes sampled from the target style space, then refined by a proposed feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Panoptic-aware Image-to-Image Translation· youtube
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Cancer-related molecular mechanisms research
