Human-in-the-loop Abstractive Dialogue Summarization
Jiaao Chen, Mohan Dodda, Diyi Yang

TL;DR
This paper introduces a human-in-the-loop reinforcement learning approach for abstractive dialogue summarization, leveraging human feedback at multiple levels to improve coherence, faithfulness, and overall quality of summaries.
Contribution
It proposes a novel training method that incorporates local and global human feedback via reinforcement learning to enhance dialogue summarization quality.
Findings
Outperforms state-of-the-art supervised models in human evaluations.
Effective in capturing salient information and improving summary coherence.
Demonstrates strong generalization across multiple datasets.
Abstract
Abstractive dialogue summarization has received increasing attention recently. Despite the fact that most of the current dialogue summarization systems are trained to maximize the likelihood of human-written summaries and have achieved significant results, there is still a huge gap in generating high-quality summaries as determined by humans, such as coherence and faithfulness, partly due to the misalignment in maximizing a single human-written summary. To this end, we propose to incorporate different levels of human feedback into the training process. This will enable us to guide the models to capture the behaviors humans care about for summaries. Specifically, we ask humans to highlight the salient information to be included in summaries to provide the local feedback , and to make overall comparisons among summaries in terms of coherence, accuracy, coverage, concise and overall…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
