MulCogBench: A Multi-modal Cognitive Benchmark Dataset for Evaluating   Chinese and English Computational Language Models

Yunhao Zhang; Xiaohan Zhang; Chong Li; Shaonan Wang; Chengqing Zong

arXiv:2403.01116·cs.CL·March 5, 2024·2 cites

MulCogBench: A Multi-modal Cognitive Benchmark Dataset for Evaluating Chinese and English Computational Language Models

Yunhao Zhang, Xiaohan Zhang, Chong Li, Shaonan Wang, Chengqing Zong

PDF

Open Access

TL;DR

This paper introduces MulCogBench, a multi-modal cognitive dataset for Chinese and English, and analyzes how language models relate to human brain data across modalities and languages.

Contribution

It presents a novel multi-modal cognitive benchmark dataset and demonstrates how language models align with human cognitive and neural data across different modalities and languages.

Findings

01

Language models share significant similarities with human cognitive data.

02

Context-aware models outperform context-independent models with increasing stimulus complexity.

03

Deeper layers of models align more with fMRI, while shallow layers align with MEG signals.

Abstract

Pre-trained computational language models have recently made remarkable progress in harnessing the language abilities which were considered unique to humans. Their success has raised interest in whether these models represent and process language like humans. To answer this question, this paper proposes MulCogBench, a multi-modal cognitive benchmark dataset collected from native Chinese and English participants. It encompasses a variety of cognitive data, including subjective semantic ratings, eye-tracking, functional magnetic resonance imaging (fMRI), and magnetoencephalography (MEG). To assess the relationship between language models and cognitive data, we conducted a similarity-encoding analysis which decodes cognitive data based on its pattern similarity with textual embeddings. Results show that language models share significant similarities with human cognitive data and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Advanced Text Analysis Techniques