Text embedding models yield detailed conceptual knowledge maps derived from short multiple-choice quizzes
Paxton C. Fitzpatrick, Andrew C. Heusser, Jeremy R. Manning

TL;DR
This paper shows how text embedding models and short quizzes can create detailed maps of learners' conceptual knowledge and track how it changes over time.
Contribution
The novel use of text embeddings to track conceptual knowledge acquisition from quiz responses and video content.
Findings
A mathematical framework using text embeddings captures detailed conceptual knowledge.
Quiz responses combined with video transcripts reveal how learners' knowledge evolves.
The method predicts quiz success and identifies gaps in learners' understanding.
Abstract
Real-world conceptual knowledge is complex, multifaceted, and substantially over-simplified in most laboratory studies. Here we develop a mathematical framework, based on natural language processing models, for tracking and characterizing the acquisition of real-world conceptual knowledge. Our approach embeds each concept in a high-dimensional representation space where nearby coordinates reflect similar or related concepts. We test our approach using behavioral data from participants who answered small sets of multiple-choice quiz questions interleaved between watching two Khan Academy course videos. We apply our framework to the videos’ transcripts and the text of the quiz questions to quantify the content of each moment of video and each quiz question. We use these embeddings, along with participants’ quiz responses, to track how the learners’ knowledge changed after watching each…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsChild and Animal Learning Development · Multimodal Machine Learning Applications · Visual and Cognitive Learning Processes
