Graph Transformer for Graph-to-Sequence Learning
Deng Cai, Wai Lam

TL;DR
This paper introduces Graph Transformer, a novel model for graph-to-sequence learning that uses explicit relation encoding to enable direct communication between distant nodes, improving performance on text generation and translation tasks.
Contribution
The paper presents a new Graph Transformer model that enhances global graph structure modeling by allowing direct node communication, outperforming existing graph neural network approaches.
Findings
Achieved 27.4 BLEU on AMR-to-text generation, surpassing previous results.
Set new state-of-the-art BLEU scores on syntax-based translation tasks.
Demonstrated improved performance over existing models in multiple graph-to-sequence applications.
Abstract
The dominant graph-to-sequence transduction models employ graph neural networks for graph representation learning, where the structural information is reflected by the receptive field of neurons. Unlike graph neural networks that restrict the information exchange between immediate neighborhood, we propose a new model, known as Graph Transformer, that uses explicit relation encoding and allows direct communication between two distant nodes. It provides a more efficient way for global graph structure modeling. Experiments on the applications of text generation from Abstract Meaning Representation (AMR) and syntax-based neural machine translation show the superiority of our proposed model. Specifically, our model achieves 27.4 BLEU on LDC2015E86 and 29.7 BLEU on LDC2017T10 for AMR-to-text generation, outperforming the state-of-the-art results by up to 2.2 points. On the syntax-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Graph Neural Networks
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Adam · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Multi-Head Attention · Byte Pair Encoding · Dense Connections
