Rich Knowledge Sources Bring Complex Knowledge Conflicts: Recalibrating   Models to Reflect Conflicting Evidence

Hung-Ting Chen; Michael J.Q. Zhang; Eunsol Choi

arXiv:2210.13701·cs.CL·October 26, 2022

Rich Knowledge Sources Bring Complex Knowledge Conflicts: Recalibrating Models to Reflect Conflicting Evidence

Hung-Ting Chen, Michael J.Q. Zhang, Eunsol Choi

PDF

Open Access

TL;DR

This paper investigates how question answering models handle conflicting information from large knowledge sources, revealing reliance on non-parametric data and minimal confidence adjustment in contradictions, and proposes a calibration method to improve response consistency.

Contribution

The study highlights the impact of knowledge conflicts on model reliance and introduces a calibration approach to better manage conflicting evidence in question answering.

Findings

01

Models rely more on retrieved passages than parametric knowledge.

02

Contradictions among sources have little effect on model confidence.

03

Calibration discourages models from committing to a single answer amidst conflicts.

Abstract

Question answering models can use rich knowledge sources -- up to one hundred retrieved passages and parametric knowledge in the large-scale language model (LM). Prior work assumes information in such knowledge sources is consistent with each other, paying little attention to how models blend information stored in their LM parameters with that from retrieved evidence documents. In this paper, we simulate knowledge conflicts (i.e., where parametric knowledge suggests one answer and different passages suggest different answers) and examine model behaviors. We find retrieval performance heavily impacts which sources models rely on, and current models mostly rely on non-parametric knowledge in their best-performing settings. We discover a troubling trend that contradictions among knowledge sources affect model confidence only marginally. To address this issue, we present a new calibration…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications