Abstractive Summarization for Low Resource Data using Domain Transfer   and Data Synthesis

Ahmed Magooda; Diane Litman

arXiv:2002.03407·cs.CL·February 11, 2020·6 cites

Abstractive Summarization for Low Resource Data using Domain Transfer and Data Synthesis

Ahmed Magooda, Diane Litman

PDF

Open Access

TL;DR

This paper enhances abstractive summarization for low-resource domains by combining domain transfer and data synthesis, leading to improved ROUGE scores and more coherent summaries in student reflection data.

Contribution

It introduces a combined approach of domain transfer and data synthesis to improve summarization in low-resource settings, demonstrating significant performance gains.

Findings

01

Tuned models outperform models trained only on target data

02

Data synthesis further improves ROUGE scores

03

Combining domain transfer and data synthesis yields the best results

Abstract

Training abstractive summarization models typically requires large amounts of data, which can be a limitation for many domains. In this paper we explore using domain transfer and data synthesis to improve the performance of recent abstractive summarization methods when applied to small corpora of student reflections. First, we explored whether tuning state of the art model trained on newspaper data could boost performance on student reflection data. Evaluations demonstrated that summaries produced by the tuned model achieved higher ROUGE scores compared to model trained on just student reflection data or just newspaper data. The tuned model also achieved higher scores compared to extractive summarization baselines, and additionally was judged to produce more coherent and readable summaries in human evaluations. Second, we explored whether synthesizing summaries of student data could…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques