Generating A Crowdsourced Conversation Dataset to Combat Cybergrooming
Xinyi Zhang, Pamela J. Wisniewski, Jin-hee Cho, Lifu Huang, Sang Won, Lee

TL;DR
This paper advocates for creating authentic conversational datasets through surveys to train AI agents that educate adolescents about cybergrooming, aiming to improve prevention and awareness.
Contribution
It introduces a novel approach to develop large-scale datasets via surveys to enhance AI-based educational tools against cybergrooming.
Findings
Initial survey design considerations for authentic data collection
Open questions on dataset development and ethical considerations
Potential for improved AI training with new datasets
Abstract
Cybergrooming emerges as a growing threat to adolescent safety and mental health. One way to combat cybergrooming is to leverage predictive artificial intelligence (AI) to detect predatory behaviors in social media. However, these methods can encounter challenges like false positives and negative implications such as privacy concerns. Another complementary strategy involves using generative artificial intelligence to empower adolescents by educating them about predatory behaviors. To this end, we envision developing state-of-the-art conversational agents to simulate the conversations between adolescents and predators for educational purposes. Yet, one key challenge is the lack of a dataset to train such conversational agents. In this position paper, we present our motivation for empowering adolescents to cope with cybergrooming. We propose to develop large-scale, authentic datasets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImpact of AI and Big Data on Business and Society · Virtual Reality Applications and Impacts
