SCOP: Evaluating the Comprehension Process of Large Language Models from a Cognitive View

Yongjie Xiao; Hongru Liang; Peixin Qin; Yao Zhang; Wenqiang Lei

arXiv:2506.05000·cs.CL·September 8, 2025

SCOP: Evaluating the Comprehension Process of Large Language Models from a Cognitive View

Yongjie Xiao, Hongru Liang, Peixin Qin, Yao Zhang, Wenqiang Lei

PDF

Open Access

TL;DR

This paper introduces SCOP, a framework to evaluate large language models' comprehension processes from a cognitive perspective, revealing their limitations and similarities to human experts in understanding information.

Contribution

It provides a systematic method to assess LLMs' comprehension skills and offers insights into their alignment with human expert processes.

Findings

01

LLMs struggle to match expert-level comprehension.

02

LLMs perform better on local than global information.

03

LLMs can reach correct answers through flawed processes.

Abstract

Despite the great potential of large language models(LLMs) in machine comprehension, it is still disturbing to fully count on them in real-world scenarios. This is probably because there is no rational explanation for whether the comprehension process of LLMs is aligned with that of experts. In this paper, we propose SCOP to carefully examine how LLMs perform during the comprehension process from a cognitive view. Specifically, it is equipped with a systematical definition of five requisite skills during the comprehension process, a strict framework to construct testing data for these skills, and a detailed analysis of advanced open-sourced and closed-sourced LLMs using the testing data. With SCOP, we find that it is still challenging for LLMs to perform an expert-level comprehension process. Even so, we notice that LLMs share some similarities with experts, e.g., performing better at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Topic Modeling · Multimodal Machine Learning Applications

MethodsFocus