GINet: Graph Interaction Network for Scene Parsing

Tianyi Wu; Yu Lu; Yu Zhu; Chuang Zhang; Ming Wu; Zhanyu Ma; Guodong; Guo

arXiv:2009.06160·cs.CV·September 15, 2020·1 cites

GINet: Graph Interaction Network for Scene Parsing

Tianyi Wu, Yu Lu, Yu Zhu, Chuang Zhang, Ming Wu, Zhanyu Ma, Guodong, Guo

PDF

Open Access 1 Repo

TL;DR

GINet introduces a novel graph interaction mechanism and semantic loss to improve scene parsing by integrating linguistic knowledge, enhancing feature representations, and promoting semantic coherence in visual context reasoning.

Contribution

The paper proposes a Graph Interaction unit and Semantic Context Loss to incorporate linguistic knowledge into scene parsing, advancing context reasoning over image regions.

Findings

01

GINet outperforms state-of-the-art methods on Pascal-Context and COCO Stuff datasets.

02

The GI unit effectively enhances feature representations with high-level semantics.

03

SC-loss improves semantic coherence and discriminative capability.

Abstract

Recently, context reasoning using image regions beyond local convolution has shown great potential for scene parsing. In this work, we explore how to incorporate the linguistic knowledge to promote context reasoning over image regions by proposing a Graph Interaction unit (GI unit) and a Semantic Context Loss (SC-loss). The GI unit is capable of enhancing feature representations of convolution networks over high-level semantics and learning the semantic coherency adaptively to each sample. Specifically, the dataset-based linguistic knowledge is first incorporated in the GI unit to promote context reasoning over the visual graph, then the evolved representations of the visual graph are mapped to each local representation to enhance the discriminated capability for scene parsing. GI unit is further improved by the SC-loss to enhance the semantic representations over the exemplar-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

PaddlePaddle/PaddleSeg
paddleOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning

MethodsConvolution