How Frontier LLMs Adapt to Neurodivergence Context: A Measurement Framework for Surface vs. Structural Change in System-Prompted Responses

Ishan Gupta; Pavlo Buryi

arXiv:2605.00113·cs.CL·May 4, 2026

How Frontier LLMs Adapt to Neurodivergence Context: A Measurement Framework for Surface vs. Structural Change in System-Prompted Responses

Ishan Gupta, Pavlo Buryi

PDF

1 Datasets

TL;DR

This paper introduces NDBench, a benchmark to measure how frontier LLMs adapt their responses to neurodivergence prompts, revealing structural changes and limitations in harm mitigation.

Contribution

The paper presents NDBench, a comprehensive framework for assessing LLM adaptation to neurodivergence contexts, including a detailed analysis of structural response changes and harm mitigation challenges.

Findings

01

LLMs produce longer, more structured responses under ND prompts.

02

Adaptation is mainly structural, with increased headings and detail.

03

Harm reduction via persona assertion alone is often ineffective.

Abstract

We examine if frontier chat-based large language models (LLMs) adjust their outputs based on neurodivergence (ND) context in system prompts and describe the nature of these adjustments. Specifically, we propose NDBench, a 576-output benchmark involving two frontier models, three system prompt types (baseline, ND-profile assertion, and ND-profile assertion with explicit instructions for adjustments), four canonical ND profiles, and 24 prompts across four categories, one of which involves an adversarial masking strategy. Four trends emerge consistently from our findings. First, LLMs show significant adaptation under ND context, where fully instructed conditions yield lengthier and more structured outputs, characterized by higher token counts, more headings, and more granular steps (p < 10^-8, Holm-corrected). Second, such adaptation is largely structural in nature: although list density…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

ishansgupta/ndbench
dataset· 43 dl
43 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.