How Frontier LLMs Adapt to Neurodivergence Context: A Measurement Framework for Surface vs. Structural Change in System-Prompted Responses
Ishan Gupta, Pavlo Buryi

TL;DR
This paper introduces NDBench, a benchmark to measure how frontier LLMs adapt their responses to neurodivergence prompts, revealing structural changes and limitations in harm mitigation.
Contribution
The paper presents NDBench, a comprehensive framework for assessing LLM adaptation to neurodivergence contexts, including a detailed analysis of structural response changes and harm mitigation challenges.
Findings
LLMs produce longer, more structured responses under ND prompts.
Adaptation is mainly structural, with increased headings and detail.
Harm reduction via persona assertion alone is often ineffective.
Abstract
We examine if frontier chat-based large language models (LLMs) adjust their outputs based on neurodivergence (ND) context in system prompts and describe the nature of these adjustments. Specifically, we propose NDBench, a 576-output benchmark involving two frontier models, three system prompt types (baseline, ND-profile assertion, and ND-profile assertion with explicit instructions for adjustments), four canonical ND profiles, and 24 prompts across four categories, one of which involves an adversarial masking strategy. Four trends emerge consistently from our findings. First, LLMs show significant adaptation under ND context, where fully instructed conditions yield lengthier and more structured outputs, characterized by higher token counts, more headings, and more granular steps (p < 10^-8, Holm-corrected). Second, such adaptation is largely structural in nature: although list density…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
