A Puzzle-Based Dataset for Natural Language Inference

Roxana Szomiu; Adrian Groza

arXiv:2112.05742·cs.AI·October 28, 2025·1 cites

A Puzzle-Based Dataset for Natural Language Inference

Roxana Szomiu, Adrian Groza

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces a new dataset of logical puzzles in natural language, designed to evaluate natural language inference and understanding by providing annotated questions with verified answers.

Contribution

It presents a novel puzzle-based dataset with verified entailment, contradiction, and ambiguity labels, emphasizing puzzle quality for machine comprehension.

Findings

01

Dataset covers three domains: comparing puzzles, knights and knaves, zebra puzzles.

02

Questions are generated from relations and individuals in the puzzles.

03

Verified answers ensure reliability for natural language inference tasks.

Abstract

We provide here a dataset for tasks related to natural language understanding and natural language inference. The dataset contains logical puzzles in natural language from three domains: comparing puzzles, knighs and knaves, and zebra puzzles. Each puzzle is associated with the entire set of atomic questions that can be generated based on the relations and individuals occurring in the text. For each question we provide the correct answer: entailment, contradiction or ambiguity. The answer's correctness is verified against theorem provers. Good puzzles have two properties: (i) each piece of information is necessary and (ii) no unnecessary information is provided. These properties make puzzles interesting candidates for machine comprehension tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://bitbucket.org/RoxanaSz/puzzte
noneOfficial

Datasets

tasksource/puzzte
dataset· 25 dl
25 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications