Neurosymbolic Language Reasoning as Satisfiability Modulo Theory

Hyunseok Oh; Sam Stern; Youngki Lee; Matthai Philipose

arXiv:2602.18095·cs.AI·February 23, 2026

Neurosymbolic Language Reasoning as Satisfiability Modulo Theory

Hyunseok Oh, Sam Stern, Youngki Lee, Matthai Philipose

PDF

Open Access 3 Reviews

TL;DR

This paper introduces Logitext, a neurosymbolic language that combines natural language constraints with SMT solving to improve reasoning in natural language understanding tasks, extending neurosymbolic methods beyond formal domains.

Contribution

It presents Logitext, a novel framework that integrates LLMs with SMT solvers for joint textual and logical reasoning on natural documents.

Findings

01

Improves accuracy and coverage on content moderation tasks.

02

Extends neurosymbolic reasoning to partial logical structures in natural language.

03

First approach to treat LLM reasoning as an SMT theory.

Abstract

Natural language understanding requires interleaving textual and logical reasoning, yet large language models often fail to perform such reasoning reliably. Existing neurosymbolic systems combine LLMs with solvers but remain limited to fully formalizable tasks such as math or program synthesis, leaving natural documents with only partial logical structure unaddressed. We introduce Logitext, a neurosymbolic language that represents documents as natural language text constraints (NLTCs), making partial logical structure explicit. We develop an algorithm that integrates LLM-based constraint evaluation with satisfiability modulo theory (SMT) solving, enabling joint textual-logical reasoning. Experiments on a new content moderation benchmark, together with LegalBench and Super-Natural Instructions, show that Logitext improves both accuracy and coverage. This work is the first that treats…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 6Confidence 4

Strengths

1. This paper proposes a neurosymbolic language, LogiText. LogiText does not require converting the entire document into strict logical formulas; instead, it allows for explicitly annotating only the key logical structures (e.g., Boolean relations) while retaining ambiguous textual clauses as natural language. This design bridges the gap between traditional symbolic solvers (which require fully formalizable domains) and real-world complex documents (which are essentially a mix of text and logic)

Weaknesses

1. The framework's reliance on precise clause-level annotation and evaluation is a critical vulnerability. LogiText relies on human experts to manually annotate natural language documents. This not only incurs high labor costs but also limits the method's scalability and application scope. Furthermore, the framework is fragile to "clause-level" errors. It still relies on the precise evaluation of each clause, and a failure in evaluating one clause can cause the entire logical chain to collapse.

Reviewer 02Rating 2Confidence 3

Strengths

1. The paper does address a real problem wrt LLMs struggling with logical consistency in policy documents. 2. The paper is clearly motivated through empirics on reasoning gaps. 3. To my knowledge, this mixture of SMT solvers with NL constraints is novel.

Weaknesses

1. The Logitext system seems quite contrived and requires significant manual effort to convert natural docs. 2. Manual annotation of the logical structure defeats the purpose of scalable automated reasoning. Therefore real usage of this is highly questionable. 3. The convergence of the NLSolver is not guaranteed and the caching employed seems quite ad-hoc. 4. Some of the proposed baselines appear weak and raise concerns about high quality evals.

Reviewer 03Rating 2Confidence 4

Strengths

1. The paper newly introduces a language called Logitext to fully represent the logical structure behind the natural language text. This is a very important problem in legal, political, and societal domain. 2. The performance is impressive. Logitext performs much better than few-shot prompting and the previous neurosymbolic approach on various benchmarks.

Weaknesses

The presentation should be further improved for clarifications. ### Major points - There are lots of words "solver", but I'm confused which solver do you exactly mean. I think you'd better come up with a clearer notation. There are lots of related words (e.g., solver, logical solver, SMT solver, LLM-based solver, symbolic solver), but for people who read this paper for the first time, it's really hard to get which "solver" do you mean in the paper throughout. - For Logitext, what if an implicit

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques