StagePilot: A Deep Reinforcement Learning Agent for Stage-Controlled Cybergrooming Simulation

Heajun An; Qi Zhang; Minqian Liu; Xinyi Zhang; Sang Won Lee; Lifu Huang; Pamela J. Wisniewski; Jin-Hee Cho

arXiv:2602.05060·cs.LG·February 6, 2026

StagePilot: A Deep Reinforcement Learning Agent for Stage-Controlled Cybergrooming Simulation

Heajun An, Qi Zhang, Minqian Liu, Xinyi Zhang, Sang Won Lee, Lifu Huang, Pamela J. Wisniewski, Jin-Hee Cho

PDF

Open Access

TL;DR

StagePilot is an offline reinforcement learning dialogue agent that simulates grooming behaviors for youth prevention training, balancing realism, emotional engagement, and strategic planning.

Contribution

It introduces a novel RL-based simulation framework for modeling grooming stages with constrained transitions and composite rewards.

Findings

01

IQL+AWAC achieves 43% higher stage completion than baselines.

02

Over 70% sentiment alignment in generated dialogues.

03

Realistic and coherent grooming simulations validated through LLM-based evaluation.

Abstract

Cybergrooming is an evolving threat to youth, necessitating proactive educational interventions. We propose StagePilot, an offline RL-based dialogue agent that simulates the stage-wise progression of grooming behaviors for prevention training. StagePilot selects conversational stages using a composite reward that balances user sentiment and goal proximity, with transitions constrained to adjacent stages for realism and interpretability. We evaluate StagePilot through LLM-based simulations, measuring stage completion, dialogue efficiency, and emotional engagement. Results show that StagePilot generates realistic and coherent conversations aligned with grooming dynamics. Among tested methods, the IQL+AWAC agent achieves the best balance between strategic planning and emotional coherence, reaching the final stage up to 43% more frequently than baselines while maintaining over 70% sentiment…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmotion and Mood Recognition · Digital Mental Health Interventions · Intelligent Tutoring Systems and Adaptive Learning