PROM: A Phrase-level Copying Mechanism with Pre-training for Abstractive   Summarization

Xinbei Ma; Yeyun Gong; Pengcheng He; Hai Zhao; Nan Duan

arXiv:2305.06647·cs.CL·February 29, 2024·1 cites

PROM: A Phrase-level Copying Mechanism with Pre-training for Abstractive Summarization

Xinbei Ma, Yeyun Gong, Pengcheng He, Hai Zhao, Nan Duan

PDF

Open Access 1 Repo

TL;DR

PROM introduces a phrase-level copying mechanism with an indicator layer and auxiliary loss, significantly improving abstractive summarization performance, especially in zero-shot settings, by enhancing copying accuracy and factuality.

Contribution

The paper proposes PROM, a novel phrase-level copying mechanism with explicit n-gram attention and auxiliary loss, applicable to zero-shot and fine-tuning scenarios, advancing summarization quality.

Findings

01

Significant improvements on benchmark datasets.

02

Effective in zero-shot summarization with pre-training.

03

Promotes more accurate and faithful copying.

Abstract

Based on the remarkable achievements of pre-trained language models in abstractive summarization, the copying mechanism has proved helpful by improving the factuality, stability, and overall performance. This work proposes PROM, a new PhRase-level cOpying Mechanism that enhances attention on n-grams, which can be applied to zero-shot summarization with pre-training. PROM adds an indicator layer to explicitly pick up tokens in n-gram that can be copied from the source, and calculates an auxiliary loss for the copying prediction. Empirical studies show that PROM makes significant improvements in fine-tuning on benchmarks. In zero-shot setting, PROM is utilized in the self-supervised pre-training on raw corpora and provides new general baselines on a wide range of summarization datasets. Further analysis shows that PROM performs more reasonable copying and contributes to faithfulness.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xbmxb/prom
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification