Guiding ChatGPT to Generate Salient Domain Summaries
Jun Gao, Ziqiang Cao, Shaoyao Huang, Luozheng Qin, Chunhui Ai

TL;DR
This paper introduces PADS, a pipeline that enhances ChatGPT's domain-specific summarization by retrieving relevant examples and reranking candidate summaries, significantly improving salience and ROUGE scores across diverse datasets.
Contribution
PADS is a novel, lightweight framework combining retrieval and reranking to guide ChatGPT in generating more salient domain summaries in zero-shot settings.
Findings
PADS achieves over +8 ROUGE-L gain on Gigaword.
Effective demonstration retrieval improves summary salience.
Reranking with a small trainable model enhances summary quality.
Abstract
ChatGPT is instruct-tuned to generate general and human-expected content to align with human preference through Reinforcement Learning from Human Feedback (RLHF), meanwhile resulting in generated responses not salient enough. Therefore, in this case, ChatGPT may fail to satisfy domain requirements in zero-shot settings, leading to poor ROUGE scores. Inspired by the In-Context Learning (ICL) and retelling ability of ChatGPT, this paper proposes PADS, a \textbf{P}ipeline for \textbf{A}ssisting ChatGPT in \textbf{D}omain \textbf{S}ummarization. PADS consists of a retriever to retrieve similar examples from corpora and a rank model to rerank the multiple candidate summaries generated by ChatGPT. Specifically, given an inference document, we first retrieve an in-context demonstration via the retriever. Then, we require ChatGPT to generate candidate summaries for the inference document at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsALIGN
