Course-Skill Atlas: A national longitudinal dataset of skills taught in U.S. higher education curricula
Alireza Javadian Sabet, Sarah H. Bana, Renzhe Yu, Morgan R., Frank

TL;DR
Course-Skill Atlas is a comprehensive, longitudinal dataset that maps skills taught in U.S. higher education curricula to detailed workplace activities, enabling analysis of education's role in workforce preparation.
Contribution
This work introduces a novel dataset linking course syllabi to workplace skills using NLP, filling a gap in understanding higher education's contribution to skill development.
Findings
Dataset covers over 3 million courses from nearly 3,000 institutions.
Aligns course content with detailed workplace activities using NLP.
Provides insights into skills development trends in higher education.
Abstract
Higher education plays a critical role in driving an innovative economy by equipping students with knowledge and skills demanded by the workforce. While researchers and practitioners have developed data systems to track detailed occupational skills, such as those established by the U.S. Department of Labor (DOL), much less effort has been made to document which of these skills are being developed in higher education at a similar granularity. Here, we fill this gap by presenting Course-Skill Atlas -- a longitudinal dataset of skills inferred from over three million course syllabi taught at nearly three thousand U.S. higher education institutions. To construct Course-Skill Atlas, we apply natural language processing to quantify the alignment between course syllabi and detailed workplace activities (DWAs) used by the DOL to describe occupations. We then aggregate these alignment scores to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHigher Education Learning Practices · Online Learning and Analytics
