A Transfer Learning Pipeline for Educational Resource Discovery with Application in Leading Paragraph Generation
Irene Li, Thomas George, Alexander Fabbri, Tammy Liao, Benjamin Chen,, Rina Kawamura, Richard Zhou, Vanessa Yan, Swapnil Hingmire, Dragomir Radev

TL;DR
This paper introduces a transfer learning pipeline for discovering educational resources across new domains, utilizing a novel neural network model for feature extraction, and demonstrates its application in generating leading paragraphs for surveys.
Contribution
It presents a new ERD pipeline with a pretrained neural network model for feature extraction and resource classification, and applies it to enhance survey paragraph generation.
Findings
Achieved F1 scores of 0.94 and 0.82 on target domains.
Developed a novel query-document masked language model (QD-MLM).
Released a large corpus of web resources and queries.
Abstract
Effective human learning depends on a wide selection of educational materials that align with the learner's current understanding of the topic. While the Internet has revolutionized human learning or education, a substantial resource accessibility barrier still exists. Namely, the excess of online information can make it challenging to navigate and discover high-quality learning materials. In this paper, we propose the educational resource discovery (ERD) pipeline that automates web resource discovery for novel domains. The pipeline consists of three main steps: data collection, feature extraction, and resource classification. We start with a known source domain and conduct resource discovery on two unseen target domains via transfer learning. We first collect frequent queries from a set of seed documents and search on the web to obtain candidate resources, such as lecture slides and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline Learning and Analytics · Topic Modeling · Text and Document Classification Technologies
