Evaluating 21st-Century Competencies in Postsecondary Curricula with Large Language Models: Performance Benchmarking and Reasoning-Based Prompting Strategies
Zhen Xu, Xin Guan, Chenxi Shi, Qinhao Chen, Renzhe Yu

TL;DR
This study benchmarks large language models for analyzing postsecondary curricula to evaluate 21st-century competencies, introducing a reasoning-based prompting strategy that improves pedagogical inference but still falls short of human precision.
Contribution
It extends curricular analytics by benchmarking LLMs, analyzing curriculum sources, and proposing Curricular CoT prompting to enhance pedagogical reasoning in competency assessment.
Findings
Detailed instructional descriptions are most informative.
Open-weight LLMs perform comparably to proprietary models on coarse tasks.
Curricular CoT modestly improves reasoning and bias reduction.
Abstract
The growing emphasis on 21st-century competencies in postsecondary education, intensified by the transformative impact of generative AI, underscores the need to evaluate how these competencies are embedded in curricula and how effectively academic programs align with evolving workforce and societal demands. Curricular Analytics, particularly recent generative AI-powered approaches, offer a promising data-driven pathway. However, analyzing 21st-century competencies requires pedagogical reasoning beyond surface-level information retrieval, and the capabilities of large language models in this context remain underexplored. In this study, we extend prior curricular analytics research by examining a broader range of curriculum documents, competency frameworks, and models. Using 7,600 manually annotated curriculum-competency alignment scores, we assess the informativeness of different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline Learning and Analytics · Educational Assessment and Pedagogy · Intelligent Tutoring Systems and Adaptive Learning
