Probing of Quantitative Values in Abstractive Summarization Models

Nathan M. White

arXiv:2210.00667·cs.CL·October 4, 2022

Probing of Quantitative Values in Abstractive Summarization Models

Nathan M. White

PDF

Open Access

TL;DR

This paper investigates how well current abstractive summarization models understand and represent quantitative data, revealing that most models struggle with accurate encoding, which may contribute to data hallucination issues.

Contribution

It introduces probing tests to evaluate quantitative value representation in summarization models and analyzes the impact of encoder performance on hallucination problems.

Findings

01

Most models' encoders poorly represent quantitative values.

02

DistilBART-CDM underperforms compared to random baselines.

03

Pretraining and fine-tuning may influence encoder effectiveness.

Abstract

Abstractive text summarization has recently become a popular approach, but data hallucination remains a serious problem, including with quantitative data. We propose a set of probing tests to evaluate the efficacy of abstract summarization models' modeling of quantitative values found in the input text. Our results show that in most cases, the encoders of recent SOTA-performing models struggle to provide embeddings that adequately represent quantitative values in the input compared to baselines, and in particular, they outperform random representations in some, but surprisingly not all, cases. Under our assumptions, this suggests that the encoder's performance contributes to the quantity hallucination problem. One model type in particular, DistilBART-CDM, was observed to underperform randomly initialized representations for several experiments, and performance versus BERT suggests that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Weight Decay · Attention Dropout · WordPiece · Adam · Dense Connections · Linear Warmup With Linear Decay · Dropout