Unveil Multi-Picture Descriptions for Multilingual Mild Cognitive Impairment Detection via Contrastive Learning

Kristin Qi; Jiali Cheng; Youxiang Zhu; Hadi Amiri; Xiaohui Liang

arXiv:2505.17067·cs.CL·September 4, 2025

Unveil Multi-Picture Descriptions for Multilingual Mild Cognitive Impairment Detection via Contrastive Learning

Kristin Qi, Jiali Cheng, Youxiang Zhu, Hadi Amiri, Xiaohui Liang

PDF

TL;DR

This paper introduces a novel framework combining contrastive learning, image modality, and a Product of Experts strategy to improve multilingual multi-picture MCI detection, achieving significant performance gains.

Contribution

It presents a new multimodal approach with contrastive learning and PoE to enhance MCI detection across multiple languages and pictures, addressing prior limitations.

Findings

01

7.1% increase in UAR over baseline

02

2.9% increase in F1 score

03

Contrastive learning benefits text modality more

Abstract

Detecting Mild Cognitive Impairment from picture descriptions is critical yet challenging, especially in multilingual and multiple picture settings. Prior work has primarily focused on English speakers describing a single picture (e.g., the 'Cookie Theft'). The TAUKDIAL-2024 challenge expands this scope by introducing multilingual speakers and multiple pictures, which presents new challenges in analyzing picture-dependent content. To address these challenges, we propose a framework with three components: (1) enhancing discriminative representation learning via supervised contrastive learning, (2) involving image modality rather than relying solely on speech and text modalities, and (3) applying a Product of Experts (PoE) strategy to mitigate spurious correlations and overfitting. Our framework improves MCI detection performance, achieving a +7.1% increase in Unweighted Average Recall…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsContrastive Learning