From Sparse to Dense: GPT-4 Summarization with Chain of Density   Prompting

Griffin Adams; Alexander Fabbri; Faisal Ladhak; Eric Lehman; No\'emie; Elhadad

arXiv:2309.04269·cs.CL·September 11, 2023·6 cites

From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting

Griffin Adams, Alexander Fabbri, Faisal Ladhak, Eric Lehman, No\'emie, Elhadad

PDF

Open Access

TL;DR

This paper introduces a 'Chain of Density' prompting method for GPT-4 that iteratively creates summaries with optimal density, balancing informativeness and readability, and demonstrates its effectiveness through human preference studies.

Contribution

It proposes a novel iterative prompting technique, Chain of Density, to generate more informative and abstractive summaries without increasing length, improving over vanilla GPT-4 prompts.

Findings

01

GPT-4 with CoD produces more abstractive summaries.

02

Humans prefer denser GPT-4 summaries over vanilla prompts.

03

CoD summaries are nearly as dense as human-written summaries.

Abstract

Selecting the ``right'' amount of information to include in a summary is a difficult task. A good summary should be detailed and entity-centric without being overly dense and hard to follow. To better understand this tradeoff, we solicit increasingly dense GPT-4 summaries with what we refer to as a ``Chain of Density'' (CoD) prompt. Specifically, GPT-4 generates an initial entity-sparse summary before iteratively incorporating missing salient entities without increasing the length. Summaries generated by CoD are more abstractive, exhibit more fusion, and have less of a lead bias than GPT-4 summaries generated by a vanilla prompt. We conduct a human preference study on 100 CNN DailyMail articles and find that that humans prefer GPT-4 summaries that are more dense than those generated by a vanilla prompt and almost as dense as human written summaries. Qualitative analysis supports the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Residual Connection · Adam · Byte Pair Encoding · Softmax · Dropout · Label Smoothing · Absolute Position Encodings