CogPic: A Multimodal Dataset for Early Cognitive Impairment Assessment via Picture Description Tasks

Liuyu Wu; Rui Feng; Jie Li; Wentao Xiang; Yi Zhang; Yin Cao; Siyang Song; Xiao Gu; Jianqing Li; and Wei Wang

arXiv:2604.01626·cs.DB·April 3, 2026

CogPic: A Multimodal Dataset for Early Cognitive Impairment Assessment via Picture Description Tasks

Liuyu Wu, Rui Feng, Jie Li, Wentao Xiang, Yi Zhang, Yin Cao, Siyang Song, Xiao Gu, Jianqing Li, and Wei Wang

PDF

1 Repo

TL;DR

CogPic introduces a large, multimodal dataset with synchronized audio, visual, and linguistic data from 574 participants for early cognitive impairment detection, enabling more robust machine learning models.

Contribution

This paper presents the CogPic dataset, the largest and most detailed multimodal benchmark for cognitive impairment assessment, with expert-validated ground truth and extensive data modalities.

Findings

01

Benchmark experiments demonstrate the dataset's potential for robust cognitive impairment detection.

02

CogPic provides a clinically validated, synchronized multimodal dataset for future research.

03

The dataset supports development of generalizable machine learning models for early dementia diagnosis.

Abstract

The automated evaluation of cognitive status utilizing multimedia technologies presents a promising frontier in early dementia diagnosis. However, the development of robust machine learning models for cognitive impairment detection is frequently hindered by the scarcity of large-scale, strictly synchronized, and clinically validated multimodal datasets. To bridge this critical gap, we introduce the CogPic database, a comprehensive multimodal benchmark meticulously designed for fine-grained cognitive impairment detection. The dataset comprises strictly synchronized audio, visual, and linguistic data continuously collected from 574 participants during a naturalistic picture description task. To establish highly reliable diagnostic ground truth, expert clinical neuropsychologists conducted exhaustive evaluations, stratifying participants into distinct cognitive groups through a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://cogpic.github.io
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.