Mind The Facts: Knowledge-Boosted Coherent Abstractive Text Summarization
Beliz Gunel, Chenguang Zhu, Michael Zeng, Xuedong Huang

TL;DR
This paper introduces a knowledge-boosted Transformer model that incorporates Wikidata facts and long-range coherence techniques to improve the factual accuracy and coherence of abstractive summaries, especially for lengthy articles.
Contribution
The paper presents a novel architecture combining entity knowledge injection from Wikidata and Transformer-XL ideas to enhance factual correctness and coherence in abstractive summarization.
Findings
Improved ROUGE scores over baseline Transformer models.
Model accurately conveys facts better than baseline.
Enhanced coherence in long article summaries.
Abstract
Neural models have become successful at producing abstractive summaries that are human-readable and fluent. However, these models have two critical shortcomings: they often don't respect the facts that are either included in the source article or are known to humans as commonsense knowledge, and they don't produce coherent summaries when the source article is long. In this work, we propose a novel architecture that extends Transformer encoder-decoder architecture in order to improve on these shortcomings. First, we incorporate entity-level knowledge from the Wikidata knowledge graph into the encoder-decoder architecture. Injecting structural world knowledge from Wikidata helps our abstractive summarization model to be more fact-aware. Second, we utilize the ideas used in Transformer-XL language model in our proposed encoder-decoder architecture. This helps our model with producing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Cosine Annealing · Residual Connection · Label Smoothing · Multi-Head Attention · Variational Dropout · Adam · *Communicated@Fast*How Do I Communicate to Expedia?
