Scene Graph Expansion for Semantics-Guided Image Outpainting
Chiao-An Yang, Cheng-Yo Tan, Wan-Cyuan Fan, Cheng-Fu Yang, Meng-Lin, Wu, Yu-Chiang Frank Wang

TL;DR
This paper introduces a novel scene graph transformer network for semantics-guided image outpainting, enabling the completion of images by understanding and expanding scene semantics at the graph level.
Contribution
The paper proposes a unique scene graph transformer that models structural information with attention at node and edge levels for improved image outpainting.
Findings
SGT effectively expands scene graphs for image completion.
Results outperform existing layout-to-image methods.
Demonstrated on MS-COCO and Visual Genome datasets.
Abstract
In this paper, we address the task of semantics-guided image outpainting, which is to complete an image by generating semantically practical content. Different from most existing image outpainting works, we approach the above task by understanding and completing image semantics at the scene graph level. In particular, we propose a novel network of Scene Graph Transformer (SGT), which is designed to take node and edge features as inputs for modeling the associated structural information. To better understand and process graph-based inputs, our SGT uniquely performs feature attention at both node and edge levels. While the former views edges as relationship regularization, the latter observes the co-occurrence of nodes for guiding the attention process. We demonstrate that, given a partial input image with its layout and scene graph, our SGT can be applied for scene graph expansion and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Image Retrieval and Classification Techniques · Visual Attention and Saliency Detection
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Laplacian EigenMap · Byte Pair Encoding · Absolute Position Encodings · Residual Connection · Laplacian Positional Encodings · Graph Transformer · Layer Normalization
