AI Knows What's Wrong But Cannot Fix It: Helicoid Dynamics in Frontier LLMs Under High-Stakes Decisions
Alejandro R Jadad

TL;DR
This paper identifies a failure pattern called helicoid dynamics in large language models, showing they can recognize errors but often fail to correct them under high-stakes, uncertain scenarios, impacting trustworthiness.
Contribution
It introduces the concept of helicoid dynamics in LLMs, demonstrating its presence across multiple systems and scenarios, and discusses implications for AI oversight and collaboration.
Findings
All tested models exhibited helicoid dynamics in high-stakes scenarios.
Models attributed persistent errors to structural training factors.
Understanding helicoid dynamics is key to improving trustworthy AI systems.
Abstract
Large language models perform reliably when their outputs can be checked: solving equations, writing code, retrieving facts. They perform differently when checking is impossible, as when a clinician chooses an irreversible treatment on incomplete data, or an investor commits capital under fundamental uncertainty. Helicoid dynamics is the name given to a specific failure regime in that second domain: a system engages competently, drifts into error, accurately names what went wrong, then reproduces the same pattern at a higher level of sophistication, recognizing it is looping and continuing nonetheless. This prospective case series documents that regime across seven leading systems (Claude, ChatGPT, Gemini, Grok, DeepSeek, Perplexity, Llama families), tested across clinical diagnosis, investment evaluation, and high-consequence interview scenarios. Despite explicit protocols designed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Clinical Reasoning and Diagnostic Skills · Ethics and Social Impacts of AI
