K-12BERT: BERT for K-12 education

Vasu Goel; Dhruv Sahnan; Venktesh V; Gaurav Sharma; Deep Dwivedi,; Mukesh Mohania

arXiv:2205.12335·cs.CL·May 26, 2022

K-12BERT: BERT for K-12 education

Vasu Goel, Dhruv Sahnan, Venktesh V, Gaurav Sharma, Deep Dwivedi,, Mukesh Mohania

PDF

1 Repo 1 Models

TL;DR

K-12BERT is a domain-specific language model trained on K-12 educational data, tailored to improve NLP tasks in the education sector, especially across multiple subjects.

Contribution

This work introduces K-12BERT, the first pre-trained language model specifically adapted for K-12 education across various subjects.

Findings

01

K-12BERT outperforms general BERT on educational NLP tasks.

02

Effective in hierarchical taxonomy tagging for K-12 content.

03

Demonstrates the importance of domain-specific pre-training.

Abstract

Online education platforms are powered by various NLP pipelines, which utilize models like BERT to aid in content curation. Since the inception of the pre-trained language models like BERT, there have also been many efforts toward adapting these pre-trained models to specific domains. However, there has not been a model specifically adapted for the education domain (particularly K-12) across subjects to the best of our knowledge. In this work, we propose to train a language model on a corpus of data curated by us across multiple subjects from various sources for K-12 education. We also evaluate our model, K12-BERT, on downstream tasks like hierarchical taxonomy tagging.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ads-ai/k12-bert-aied-2022
noneOfficial

Models

🤗
vasugoel/K-12BERT
model· 13 dl· ♡ 10
13 dl♡ 10

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Weight Decay · Linear Warmup With Linear Decay · Dense Connections · Dropout · Adam · Attention Dropout · WordPiece