Access Timing as Scaffolding: A Reinforcement Learning Approach to GenAI in Education
Janne Rotter, Pau Benazet i Montobbio, Davinia Hern\'andez-Leo

TL;DR
This study introduces a reinforcement learning-based method to optimize the timing of generative AI access in education, improving learning outcomes and engagement without explicit scaffolding.
Contribution
It presents a novel approach treating access timing as implicit scaffolding, operationalized through RL, grounded in educational theory, and validated via a controlled lab study.
Findings
Strategic access timing improved post-test performance.
Reduced task errors and time on task compared to full restriction.
No significant difference in self-reported metacognitive awareness.
Abstract
In recent years, generative AI (GenAI) in educational settings has become ubiquitous in students' daily lives, despite its potential to induce over-reliance, metacognitive disengagement, and diminished learning when used unrestrictedly. While most prior research has thus focused on how to pedagogically scaffold its usage, the question of when to allow off-the-shelf GenAI remains understudied and lacks pedagogically grounded empirical investigation. We treat access timing itself as a form of implicit scaffolding and operationalize it through a reinforcement learning (RL) agent that decides when students should access GenAI, with a reward function grounded in metacognitive theory, cognitive load theory, and productive failure. In a mixed-methods controlled lab study with N=105 participants, we compared the agent's effect on learning gains and metacognitive engagement to unrestricted and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
