MQAG: Multiple-choice Question Answering and Generation for Assessing   Information Consistency in Summarization

Potsawee Manakul; Adian Liusie; Mark J. F. Gales

arXiv:2301.12307·cs.CL·September 11, 2023

MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency in Summarization

Potsawee Manakul, Adian Liusie, Mark J. F. Gales

PDF

2 Repos 4 Models

TL;DR

This paper introduces MQAG, a novel information-theoretic framework for assessing summary quality by comparing source and summary answer distributions over automatically generated questions, outperforming existing methods.

Contribution

MQAG is a new approach that uses statistical distance between answer distributions to evaluate information consistency in summarization.

Findings

01

MQAG outperforms existing evaluation methods on multiple datasets.

02

Models trained on SQuAD or RACE achieve high accuracy in assessing summaries.

03

The approach effectively detects factual inconsistencies in summaries.

Abstract

State-of-the-art summarization systems can generate highly fluent summaries. These summaries, however, may contain factual inconsistencies and/or information not present in the source. Hence, an important component of assessing the quality of summaries is to determine whether there is information consistency between the source and the summary. Existing approaches are typically based on lexical matching or representation-based methods. In this work, we introduce an alternative scheme based on standard information-theoretic measures in which the information present in the source and summary is directly compared. We propose a Multiple-choice Question Answering and Generation framework, MQAG, which approximates the information consistency by computing the expected statistical distance between summary and source answer distributions over automatically generated multiple-choice questions.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.