Can LLMs Estimate Cognitive Complexity of Reading Comprehension Items?

Seonjeong Hwang; Hyounghun Kim; Gary Geunbae Lee

arXiv:2510.25064·cs.CL·May 20, 2026

Can LLMs Estimate Cognitive Complexity of Reading Comprehension Items?

Seonjeong Hwang, Hyounghun Kim, Gary Geunbae Lee

PDF

TL;DR

This paper investigates whether large language models can estimate the cognitive complexity of reading comprehension items, focusing on evidence scope and transformation level, to aid in pre-assessment difficulty analysis.

Contribution

It demonstrates that LLMs can approximate cognitive complexity of RC items, highlighting their potential for automated prior difficulty estimation.

Findings

01

LLMs can estimate cognitive complexity dimensions of RC items.

02

A gap exists between LLM reasoning ability and metacognitive awareness.

03

LLMs show promise as tools for pre-assessment difficulty analysis.

Abstract

Estimating the cognitive complexity of reading comprehension (RC) items is crucial for assessing item difficulty before it is administered to learners. Unlike syntactic and semantic features, such as passage length or semantic similarity between options, cognitive features that arise during answer reasoning are not readily extractable using existing NLP tools and have traditionally relied on human annotation. In this study, we examine whether large language models (LLMs) can estimate the cognitive complexity of RC items by focusing on two dimensions-Evidence Scope and Transformation Level-that indicate the degree of cognitive burden involved in reasoning about the answer. Our experimental results demonstrate that LLMs can approximate the cognitive complexity of items, indicating their potential as tools for prior difficulty analysis. Further analysis reveals a gap between LLMs'…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.