Mind The Facts: Knowledge-Boosted Coherent Abstractive Text   Summarization

Beliz Gunel; Chenguang Zhu; Michael Zeng; Xuedong Huang

arXiv:2006.15435·cs.CL·June 30, 2020·27 cites

Mind The Facts: Knowledge-Boosted Coherent Abstractive Text Summarization

Beliz Gunel, Chenguang Zhu, Michael Zeng, Xuedong Huang

PDF

Open Access

TL;DR

This paper introduces a knowledge-boosted Transformer model that incorporates Wikidata facts and long-range coherence techniques to improve the factual accuracy and coherence of abstractive summaries, especially for lengthy articles.

Contribution

The paper presents a novel architecture combining entity knowledge injection from Wikidata and Transformer-XL ideas to enhance factual correctness and coherence in abstractive summarization.

Findings

01

Improved ROUGE scores over baseline Transformer models.

02

Model accurately conveys facts better than baseline.

03

Enhanced coherence in long article summaries.

Abstract

Neural models have become successful at producing abstractive summaries that are human-readable and fluent. However, these models have two critical shortcomings: they often don't respect the facts that are either included in the source article or are known to humans as commonsense knowledge, and they don't produce coherent summaries when the source article is long. In this work, we propose a novel architecture that extends Transformer encoder-decoder architecture in order to improve on these shortcomings. First, we incorporate entity-level knowledge from the Wikidata knowledge graph into the encoder-decoder architecture. Injecting structural world knowledge from Wikidata helps our abstractive summarization model to be more fact-aware. Second, we utilize the ideas used in Transformer-XL language model in our proposed encoder-decoder architecture. This helps our model with producing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Cosine Annealing · Residual Connection · Label Smoothing · Multi-Head Attention · Variational Dropout · Adam · *Communicated@Fast*How Do I Communicate to Expedia?