Building Effective Safety Guardrails in AI Education Tools
Hannah-Beth Clark, Laura Benton, Emma Searle, Margaux Dowland, Matthew Gregory, Will Gayne, John Roberts

TL;DR
This paper discusses the development and evaluation of safety guardrails in an AI-powered lesson planning tool for education, aiming to ensure content safety and appropriateness for students aged 5-16.
Contribution
It introduces a comprehensive safety framework with four guardrails for AI in education and shares insights on their implementation and ongoing refinement.
Findings
Effective safety guardrails require iterative development.
Human-in-the-loop enhances content safety.
Sharing open-source resources fosters collaboration.
Abstract
There has been rapid development in generative AI tools across the education sector, which in turn is leading to increased adoption by teachers. However, this raises concerns regarding the safety and age-appropriateness of the AI-generated content that is being created for use in classrooms. This paper explores Oak National Academy's approach to addressing these concerns within the development of the UK Government's first publicly available generative AI tool - our AI-powered lesson planning assistant (Aila). Aila is intended to support teachers planning national curriculum-aligned lessons that are appropriate for pupils aged 5-16 years. To mitigate safety risks associated with AI-generated content we have implemented four key safety guardrails - (1) prompt engineering to ensure AI outputs are generated within pedagogically sound and curriculum-aligned parameters, (2) input threat…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
