Panoptic-aware Image-to-Image Translation

Liyun Zhang; Photchara Ratsamee; Bowen Wang; Zhaojie Luo; Yuki; Uranishi; Manabu Higashida; Haruo Takemura

arXiv:2112.01926·cs.CV·December 26, 2022

Panoptic-aware Image-to-Image Translation

Liyun Zhang, Photchara Ratsamee, Bowen Wang, Zhaojie Luo, Yuki, Uranishi, Manabu Higashida, Haruo Takemura

PDF

Open Access 1 Video

TL;DR

This paper introduces PanopticGAN, a novel image-to-image translation method that leverages panoptic perception to improve fidelity and object recognition in complex scenes by aligning object content with style codes.

Contribution

The paper proposes a panoptic-aware GAN framework that incorporates panoptic segmentation to enhance image translation quality and object detail preservation.

Findings

01

Significant improvement in image quality over existing methods

02

Enhanced object recognition performance in translated images

03

Effective alignment of object content and style codes

Abstract

Despite remarkable progress in image translation, the complex scene with multiple discrepant objects remains a challenging problem. The translated images have low fidelity and tiny objects in fewer details causing unsatisfactory performance in object recognition. Without thorough object perception (i.e., bounding boxes, categories, and masks) of images as prior knowledge, the style transformation of each object will be difficult to track in translation. We propose panoptic-aware generative adversarial networks (PanopticGAN) for image-to-image translation together with a compact panoptic segmentation dataset. The panoptic perception (i.e., foreground instances and background semantics of the image scene) is extracted to achieve alignment between object content codes of the input domain and panoptic-level style codes sampled from the target style space, then refined by a proposed feature…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Panoptic-aware Image-to-Image Translation· youtube

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Cancer-related molecular mechanisms research