TL;DR
This paper introduces MQAG, a novel information-theoretic framework for assessing summary quality by comparing source and summary answer distributions over automatically generated questions, outperforming existing methods.
Contribution
MQAG is a new approach that uses statistical distance between answer distributions to evaluate information consistency in summarization.
Findings
MQAG outperforms existing evaluation methods on multiple datasets.
Models trained on SQuAD or RACE achieve high accuracy in assessing summaries.
The approach effectively detects factual inconsistencies in summaries.
Abstract
State-of-the-art summarization systems can generate highly fluent summaries. These summaries, however, may contain factual inconsistencies and/or information not present in the source. Hence, an important component of assessing the quality of summaries is to determine whether there is information consistency between the source and the summary. Existing approaches are typically based on lexical matching or representation-based methods. In this work, we introduce an alternative scheme based on standard information-theoretic measures in which the information present in the source and summary is directly compared. We propose a Multiple-choice Question Answering and Generation framework, MQAG, which approximates the information consistency by computing the expected statistical distance between summary and source answer distributions over automatically generated multiple-choice questions.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗potsawee/t5-large-generation-race-QuestionAnswermodel· 182 dl· ♡ 17182 dl♡ 17
- 🤗potsawee/t5-large-generation-race-Distractormodel· 432 dl· ♡ 12432 dl♡ 12
- 🤗potsawee/t5-large-generation-squad-QuestionAnswermodel· 519 dl· ♡ 49519 dl♡ 49
- 🤗potsawee/longformer-large-4096-answering-racemodel· 295 dl· ♡ 16295 dl♡ 16
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
