Assessing LLMs for Zero-shot Abstractive Summarization Through the Lens   of Relevance Paraphrasing

Hadi Askari; Anshuman Chhabra; Muhao Chen; Prasant Mohapatra

arXiv:2406.03993·cs.CL·February 4, 2025·1 cites

Assessing LLMs for Zero-shot Abstractive Summarization Through the Lens of Relevance Paraphrasing

Hadi Askari, Anshuman Chhabra, Muhao Chen, Prasant Mohapatra

PDF

Open Access 1 Repo

TL;DR

This paper introduces relevance paraphrasing as a method to evaluate the robustness of large language models in zero-shot abstractive summarization by measuring their consistency across minimally perturbed inputs.

Contribution

The paper proposes a novel relevance paraphrasing technique to assess LLM robustness in zero-shot summarization, highlighting variability in model performance under input perturbations.

Findings

01

LLMs show inconsistent summarization results with minimally perturbed inputs.

02

Relevance paraphrasing reveals robustness gaps in current LLMs.

03

Performance varies across different datasets and model sizes.

Abstract

Large Language Models (LLMs) have achieved state-of-the-art performance at zero-shot generation of abstractive summaries for given articles. However, little is known about the robustness of such a process of zero-shot summarization. To bridge this gap, we propose relevance paraphrasing, a simple strategy that can be used to measure the robustness of LLMs as summarizers. The relevance paraphrasing approach identifies the most relevant sentences that contribute to generating an ideal summary, and then paraphrases these inputs to obtain a minimally perturbed dataset. Then, by evaluating model performance for summarization on both the original and perturbed datasets, we can assess the LLM's one aspect of robustness. We conduct extensive experiments with relevance paraphrasing on 4 diverse datasets, as well as 4 LLMs of different sizes (GPT-3.5-Turbo, Llama-2-13B, Mistral-7B, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HadiAskari/Relevance-Paraphrasing
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Mathematics, Computing, and Information Processing