# Snake Oil or Panacea? How to Misuse AI in Scientific Inquiries of the Human Mind

**Authors:** René Schlegelmilch, Lenard Dome

PMC · DOI: 10.3390/bs16020219 · Behavioral Sciences · 2026-02-03

## TL;DR

This paper warns that AI models can give misleading predictions about human behavior if they rely on accidental patterns in data.

## Contribution

The paper introduces a diagnostic tool to detect unreliable AI predictions in behavioral studies.

## Key findings

- LLMs often latch onto accidental patterns that fail in new experimental conditions.
- Standard validation methods do not catch these failures, leading to false confidence.
- AI predictions can reverse true relationships when applied beyond training data.

## Abstract

Large language models (LLMs) are increasingly used to predict human behavior from plain-text descriptions of experimental tasks that range from judging disease severity to consequential medical decisions. While these methods promise quick insights without complex psychological theories, we reveal a critical flaw: they often latch onto accidental patterns in the data that seem predictive but collapse when faced with novel experimental conditions. Testing across multiple behavioral studies, we show these models can generate wildly inaccurate predictions, sometimes even reversing true relationships, when applied beyond their training context. Standard validation techniques miss this flaw, creating false confidence in their reliability. We introduce a simple diagnostic tool to spot these failures and urge researchers to prioritize theoretical grounding over statistical convenience. Without this, LLM-driven behavioral predictions risk being scientifically meaningless, despite impressive initial results.

## Full-text entities

- **Diseases:** injury to (MESH:D014947), sleep deprivation (MESH:D012892), OOS (MESH:D000070591), OOD (MESH:D020243), cognitive performance (MESH:D003072)
- **Chemicals:** OOD (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12938490/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12938490/full.md

## References

50 references — full list in the complete paper: https://tomesphere.com/paper/PMC12938490/full.md

---
Source: https://tomesphere.com/paper/PMC12938490