Fast Analysis of the OpenAI O1-Preview Model in Solving Random K-SAT   Problem: Does the LLM Solve the Problem Itself or Call an External SAT   Solver?

Raffaele Marino

arXiv:2409.11232·cs.CL·September 23, 2024

Fast Analysis of the OpenAI O1-Preview Model in Solving Random K-SAT Problem: Does the LLM Solve the Problem Itself or Call an External SAT Solver?

Raffaele Marino

PDF

Open Access 1 Repo

TL;DR

This paper investigates whether the OpenAI O1-preview model directly solves random K-SAT problems or relies on external solvers, revealing it often calls external solvers and outputs incorrect solutions, raising questions about its problem-solving capabilities.

Contribution

The study provides a detailed analysis of the model's behavior in solving K-SAT problems, highlighting its dependence on external solvers and assessing its apparent intelligence.

Findings

01

Model calls external SAT solvers to find solutions.

02

Model often outputs incorrect assignments.

03

Analysis suggests the model does not truly solve K-SAT independently.

Abstract

In this manuscript, I present an analysis on the performance of OpenAI O1-preview model in solving random K-SAT instances for K $\in 2, 3, 4$ as a function of $α = M / N$ where $M$ is the number of clauses and $N$ is the number of variables of the satisfiable problem. I show that the model can call an external SAT solver to solve the instances, rather than solving them directly. Despite using external solvers, the model reports incorrect assignments as output. Moreover, I propose and present an analysis to quantify whether the OpenAI O1-preview model demonstrates a spark of intelligence or merely makes random guesses when outputting an assignment for a Boolean satisfiability problem.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

raffaelemarino/analysisopenaio1modelksat
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLaw, logistics, and international trade · Constraint Satisfaction and Optimization · Auction Theory and Applications