Classifying Math KCs via Task-Adaptive Pre-Trained BERT

Jia Tracy Shen; Michiharu Yamashita; Ethan Prihar; Neil Heffernan,; Xintao Wu; Sean McGrew; Dongwon Lee

arXiv:2105.11343·cs.CL·May 25, 2021

Classifying Math KCs via Task-Adaptive Pre-Trained BERT

Jia Tracy Shen, Michiharu Yamashita, Ethan Prihar, Neil Heffernan,, Xintao Wu, Sean McGrew, Dongwon Lee

PDF

Open Access

TL;DR

This paper enhances automatic classification of educational content's knowledge components by expanding input types, increasing label granularity, and applying task-adaptive BERT, achieving higher accuracy and proposing a new evaluation measure.

Contribution

It introduces a novel approach using task-adaptive pre-trained BERT for multi-label classification of KCs with expanded inputs and labels, outperforming previous methods.

Findings

01

Prediction accuracy improved by 0.5-2.3% over baselines.

02

Expanded input types and labels increase practical applicability.

03

Proposed evaluation measure recovers 56-73% of mispredicted labels.

Abstract

Educational content labeled with proper knowledge components (KCs) are particularly useful to teachers or content organizers. However, manually labeling educational content is labor intensive and error-prone. To address this challenge, prior research proposed machine learning based solutions to auto-label educational content with limited success. In this work, we significantly improve prior research by (1) expanding the input types to include KC descriptions, instructional video titles, and problem descriptions (i.e., three types of prediction task), (2) doubling the granularity of the prediction from 198 to 385 KC labels (i.e., more practical setting but much harder multinomial classification problem), (3) improving the prediction accuracies by 0.5-2.3% using Task-adaptive Pre-trained BERT, outperforming six baselines, and (4) proposing a simple evaluation measure by which we can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification

MethodsMulti-Head Attention · Linear Layer · Linear Warmup With Linear Decay · WordPiece · Layer Normalization · Attention Dropout · Softmax · Refunds@Expedia|||How do I get a full refund from Expedia? · Dropout · Attention Is All You Need