Semantic Deception: When Reasoning Models Can't Compute an Addition

Nathani\"el de Leeuw; Marceau Nahon; Mathis Reymond; Raja Chatila; Mehdi Khamassi

arXiv:2512.20812·cs.CL·December 25, 2025

Semantic Deception: When Reasoning Models Can't Compute an Addition

Nathani\"el de Leeuw, Marceau Nahon, Mathis Reymond, Raja Chatila, Mehdi Khamassi

PDF

Open Access

TL;DR

This paper investigates the reasoning limitations of large language models when processing novel symbolic representations, revealing their tendency to rely on semantic cues over true symbolic manipulation, which raises concerns for decision-making applications.

Contribution

The study introduces an experimental framework for testing LLMs' ability to handle unfamiliar symbols and resist misleading semantic cues, highlighting their current shortcomings in symbolic reasoning.

Findings

01

Semantic cues significantly impair LLMs' reasoning performance.

02

LLMs tend to over-rely on surface semantics rather than symbolic logic.

03

Chain-of-thought prompting may increase reliance on statistical correlations.

Abstract

Large language models (LLMs) are increasingly used in situations where human values are at stake, such as decision-making tasks that involve reasoning when performed by humans. We investigate the so-called reasoning capabilities of LLMs over novel symbolic representations by introducing an experimental framework that tests their ability to process and manipulate unfamiliar symbols. We introduce semantic deceptions: situations in which symbols carry misleading semantic associations due to their form, such as being embedded in specific contexts, designed to probe whether LLMs can maintain symbolic abstraction or whether they default to exploiting learned semantic associations. We redefine standard digits and mathematical operators using novel symbols, and task LLMs with solving simple calculations expressed in this altered notation. The objective is: (1) to assess LLMs' capacity for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Ethics and Social Impacts of AI