Towards Synthesizing Normative Data for Cognitive Assessments Using Generative Multimodal Large Language Models

Victoria Yan; Honor Chotkowski; Fengran Wang; Xinhui Li; Carl Yang; Jiaying Lu; Runze Yan; Xiao Hu; Alex Fedorov

arXiv:2508.17675·cs.LG·September 9, 2025

Towards Synthesizing Normative Data for Cognitive Assessments Using Generative Multimodal Large Language Models

Victoria Yan, Honor Chotkowski, Fengran Wang, Xinhui Li, Carl Yang, Jiaying Lu, Runze Yan, Xiao Hu, Alex Fedorov

PDF

TL;DR

This paper explores using advanced generative multimodal large language models to synthesize normative data for cognitive assessments, aiming to overcome traditional data collection challenges and facilitate the development of new image-based tests.

Contribution

It demonstrates the feasibility of using GPT-4o models with refined prompting strategies to generate synthetic normative responses for cognitive tests, capturing demographic and diagnostic variations.

Findings

01

Advanced prompts improve differentiation between diagnostic groups.

02

BERTScore is the most reliable metric for evaluating responses.

03

Synthetic responses exhibit higher realism and diversity.

Abstract

Cognitive assessments require normative data as essential benchmarks for evaluating individual performance. Hence, developing new cognitive tests based on novel image stimuli is challenging due to the lack of readily available normative data. Traditional data collection methods are costly, time-consuming, and infrequently updated, limiting their practical utility. Recent advancements in generative multimodal large language models (MLLMs) offer a new approach to generate synthetic normative data from existing cognitive test images. We investigated the feasibility of using MLLMs, specifically GPT-4o and GPT-4o-mini, to synthesize normative textual responses for established image-based cognitive assessments, such as the "Cookie Theft" picture description task. Two distinct prompting strategies-naive prompts with basic instructions and advanced prompts enriched with contextual guidance-were…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.