Scene Graph Expansion for Semantics-Guided Image Outpainting

Chiao-An Yang; Cheng-Yo Tan; Wan-Cyuan Fan; Cheng-Fu Yang; Meng-Lin; Wu; Yu-Chiang Frank Wang

arXiv:2205.02958·cs.CV·May 9, 2022·1 cites

Scene Graph Expansion for Semantics-Guided Image Outpainting

Chiao-An Yang, Cheng-Yo Tan, Wan-Cyuan Fan, Cheng-Fu Yang, Meng-Lin, Wu, Yu-Chiang Frank Wang

PDF

Open Access

TL;DR

This paper introduces a novel scene graph transformer network for semantics-guided image outpainting, enabling the completion of images by understanding and expanding scene semantics at the graph level.

Contribution

The paper proposes a unique scene graph transformer that models structural information with attention at node and edge levels for improved image outpainting.

Findings

01

SGT effectively expands scene graphs for image completion.

02

Results outperform existing layout-to-image methods.

03

Demonstrated on MS-COCO and Visual Genome datasets.

Abstract

In this paper, we address the task of semantics-guided image outpainting, which is to complete an image by generating semantically practical content. Different from most existing image outpainting works, we approach the above task by understanding and completing image semantics at the scene graph level. In particular, we propose a novel network of Scene Graph Transformer (SGT), which is designed to take node and edge features as inputs for modeling the associated structural information. To better understand and process graph-based inputs, our SGT uniquely performs feature attention at both node and edge levels. While the former views edges as relationship regularization, the latter observes the co-occurrence of nodes for guiding the attention process. We demonstrate that, given a partial input image with its layout and scene graph, our SGT can be applied for scene graph expansion and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Image Retrieval and Classification Techniques · Visual Attention and Saliency Detection

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Laplacian EigenMap · Byte Pair Encoding · Absolute Position Encodings · Residual Connection · Laplacian Positional Encodings · Graph Transformer · Layer Normalization