Do LLMs Understand Ambiguity in Text? A Case Study in Open-world   Question Answering

Aryan Keluskar; Amrita Bhattacharjee; Huan Liu

arXiv:2411.12395·cs.CL·November 20, 2024·2 cites

Do LLMs Understand Ambiguity in Text? A Case Study in Open-world Question Answering

Aryan Keluskar, Amrita Bhattacharjee, Huan Liu

PDF

Open Access

TL;DR

This paper investigates how well Large Language Models understand ambiguity in open-domain question answering and demonstrates that simple, training-free disambiguation methods can improve their performance.

Contribution

The study introduces and evaluates token-level disambiguation strategies that enhance LLM performance on ambiguous questions without additional training.

Findings

01

Token-level disambiguation improves accuracy

02

Simple methods outperform complex ones in some cases

03

Explicit disambiguation reduces hallucinations and biases

Abstract

Ambiguity in natural language poses significant challenges to Large Language Models (LLMs) used for open-domain question answering. LLMs often struggle with the inherent uncertainties of human communication, leading to misinterpretations, miscommunications, hallucinations, and biased responses. This significantly weakens their ability to be used for tasks like fact-checking, question answering, feature extraction, and sentiment analysis. Using open-domain question answering as a test case, we compare off-the-shelf and few-shot LLM performance, focusing on measuring the impact of explicit disambiguation strategies. We demonstrate how simple, training-free, token-level disambiguation methods may be effectively used to improve LLM performance for ambiguous question answering tasks. We empirically show our findings and discuss best practices and broader impacts regarding ambiguity in LLMs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques