Simulated Human Learning in a Dynamic, Partially-Observed, Time-Series Environment
Jeffrey Jiang, Kevin Hong, Emily Kuczynski, Gregory Pottie

TL;DR
This paper develops a simulated classroom environment with student-teacher interactions to evaluate reinforcement learning-based intelligent tutoring systems, highlighting the importance of probing interventions for improved student modeling and adaptive instruction.
Contribution
It introduces a dynamic, partially-observed time-series environment for testing ITSs and compares RL algorithms with heuristics, emphasizing the role of probing interventions in student estimation.
Findings
Probing interventions improve student state estimation.
RL and heuristic policies perform similarly overall.
Probing benefits diminish with more hidden information.
Abstract
While intelligent tutoring systems (ITSs) can use information from past students to personalize instruction, each new student is unique. Moreover, the education problem is inherently difficult because the learning process is only partially observable. We therefore develop a dynamic, time-series environment to simulate a classroom setting, with student-teacher interventions - including tutoring sessions, lectures, and exams. In particular, we design the simulated environment to allow for varying levels of probing interventions that can gather more information. Then, we develop reinforcement learning ITSs that combine learning the individual state of students while pulling from population information through the use of probing interventions. These interventions can reduce the difficulty of student estimation, but also introduce a cost-benefit decision to find a balance between probing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics · Recommender Systems and Techniques
