Evaluating the Effectiveness of OpenAI's Parental Control System
Kerem Ersoz, Saleh Afroogh, David Atkinson, Junfeng Jiao

TL;DR
This study assesses how well OpenAI's platform-level parental controls moderate a conversational assistant for minors, revealing strengths and gaps in alerting and overblocking, and proposing improvements for better safety and transparency.
Contribution
It introduces a two-phase evaluation protocol for parental controls, compares current and legacy models, and offers actionable recommendations to enhance moderation effectiveness and transparency.
Findings
Notifications are selective, missing alerts for some risks.
Current backend has lower leak-through than legacy models.
Overblocking of benign queries remains a challenge.
Abstract
We evaluate how effectively platform-level parental controls moderate a mainstream conversational assistant used by minors. Our two-phase protocol first builds a category-balanced conversation corpus via PAIR-style iterative prompt refinement over API, then has trained human agents replay/refine those prompts in the consumer UI using a designated child account while monitoring the linked parent inbox for alerts. We focus on seven risk areas -- physical harm, pornography, privacy violence, health consultation, fraud, hate speech, and malware and quantify four outcomes: Notification Rate (NR), Leak-Through (LR), Overblocking (OBR), and UI Intervention Rate (UIR). Using an automated judge (with targeted human audit) and comparing the current backend to legacy variants (GPT-4.1/4o), we find that notifications are selective rather than comprehensive: privacy violence, fraud, hate speech, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in Service Interactions · Digital Mental Health Interventions · Child Development and Digital Technology
