Explaining Puzzle Solutions in Natural Language: An Exploratory Study on 6x6 Sudoku

Anirudh Maiya; Razan Alghamdi; Maria Leonor Pacheco; Ashutosh Trivedi; Fabio Somenzi

arXiv:2505.15993·cs.CL·April 28, 2026

Explaining Puzzle Solutions in Natural Language: An Exploratory Study on 6x6 Sudoku

Anirudh Maiya, Razan Alghamdi, Maria Leonor Pacheco, Ashutosh Trivedi, Fabio Somenzi

PDF

TL;DR

This study evaluates five Large Language Models on their ability to solve and explain 6x6 Sudoku puzzles, revealing significant challenges in generating strategic and intuitive explanations for collaborative decision-making.

Contribution

It provides an exploratory analysis of LLMs' capabilities in solving and explaining complex puzzles, highlighting gaps in their reasoning and explanation quality.

Findings

01

One LLM shows limited puzzle-solving success.

02

None of the LLMs can produce explanations reflecting strategic reasoning.

03

Results highlight challenges in LLMs' ability to generate trustworthy, tailored explanations.

Abstract

The success of Large Language Models (LLMs) in human-AI collaborative decision-making hinges on their ability to provide trustworthy, gradual, and tailored explanations. Solving complex puzzles, such as Sudoku, offers a canonical example of this collaboration, where clear and customized explanations often hold greater importance than the final solution. In this study, we evaluate the performance of five LLMs in solving and explaining \sixsix{} Sudoku puzzles. While one LLM demonstrates limited success in solving puzzles, none can explain the solution process in a manner that reflects strategic reasoning or intuitive problem-solving. These findings underscore significant challenges that must be addressed before LLMs can become effective partners in human-AI collaborative decision-making.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.