Promoting Online Safety by Simulating Unsafe Conversations with LLMs
Owen Hoffman, Kangze Peng, Zehua You, Sajid Kamal, Sukrit Venkatagiri

TL;DR
This paper investigates using simulated unsafe conversations generated by LLMs to educate users about online scams and promote safety, leveraging feedback mechanisms to enhance learning.
Contribution
It introduces a method where two LLMs simulate scam conversations, and users provide feedback to improve understanding of online safety risks.
Findings
LLMs can effectively simulate realistic scam conversations.
User feedback enhances learning about unsafe online interactions.
The approach promotes awareness and safety in online communication.
Abstract
Generative AI, including large language models (LLMs) have the potential -- and already are being used -- to increase the speed, scale, and types of unsafe conversations online. LLMs lower the barrier for entry for bad actors to create unsafe conversations in particular because of their ability to generate persuasive and human-like text. In our current work, we explore ways to promote online safety by teaching people about unsafe conversations that can occur online with and without LLMs. We build on prior work that shows that LLMs can successfully simulate scam conversations. We also leverage research in the learning sciences that shows that providing feedback on one's hypothetical actions can promote learning. In particular, we focus on simulating scam conversations using LLMs. Our work incorporates two LLMs that converse with each other to simulate realistic, unsafe conversations that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
