QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian   Reasoning Scenarios

Timo Pierre Schrader; Lukas Lange; Simon Razniewski; Annemarie; Friedrich

arXiv:2410.10449·cs.CL·October 15, 2024

QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios

Timo Pierre Schrader, Lukas Lange, Simon Razniewski, Annemarie, Friedrich

PDF

Open Access 1 Repo 1 Video

TL;DR

QUITE is a new dataset for Bayesian reasoning in natural language, featuring complex relationships and real-world scenarios, enabling better evaluation of models' reasoning abilities.

Contribution

The paper introduces QUITE, a comprehensive dataset with natural language premises and questions for Bayesian reasoning, addressing limitations of previous datasets.

Findings

01

Logic-based models outperform large language models in reasoning tasks.

02

Neuro-symbolic models show promise for complex reasoning.

03

QUITE enables more realistic evaluation of reasoning models.

Abstract

Reasoning is key to many decision making processes. It requires consolidating a set of rule-like premises that are often associated with degrees of uncertainty and observations to draw conclusions. In this work, we address both the case where premises are specified as numeric probabilistic rules and situations in which humans state their estimates using words expressing degrees of certainty. Existing probabilistic reasoning datasets simplify the task, e.g., by requiring the model to only rank textual alternatives, by including only binary random variables, or by making use of a limited set of templates that result in less varied text. In this work, we present QUITE, a question answering dataset of real-world Bayesian reasoning scenarios with categorical random variables and complex relationships. QUITE provides high-quality natural language verbalizations of premises together with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

boschresearch/quite-emnlp24
pytorch

Videos

QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios· underline

Taxonomy

TopicsBayesian Modeling and Causal Inference · AI-based Problem Solving and Planning · Natural Language Processing Techniques

MethodsSparse Evolutionary Training