Malicious LLM-Based Conversational AI Makes Users Reveal Personal Information

Xiao Zhan; Juan Carlos Carrillo; William Seymour; Jose Such

arXiv:2506.11680·cs.CY·March 27, 2026

Malicious LLM-Based Conversational AI Makes Users Reveal Personal Information

Xiao Zhan, Juan Carlos Carrillo, William Seymour, Jose Such

PDF

Open Access

TL;DR

This study demonstrates that malicious LLM-based conversational AI can effectively extract personal information from users, highlighting significant privacy risks and the need for safeguards.

Contribution

The paper introduces a systematic investigation of malicious LLM-based CAIs designed to extract personal data, revealing their effectiveness and user perception impacts.

Findings

01

Malicious CAIs extract more personal information than benign ones.

02

Social strategies are most effective in encouraging disclosures.

03

Users perceive lower risks with certain malicious strategies.

Abstract

LLM-based Conversational AIs (CAIs), also known as GenAI chatbots, like ChatGPT, are increasingly used across various domains, but they pose privacy risks, as users may disclose personal information during their conversations with CAIs. Recent research has demonstrated that LLM-based CAIs could be used for malicious purposes. However, a novel and particularly concerning type of malicious LLM application remains unexplored: an LLM-based CAI that is deliberately designed to extract personal information from users. In this paper, we report on the malicious LLM-based CAIs that we created based on system prompts that used different strategies to encourage disclosures of personal information from users. We systematically investigate CAIs' ability to extract personal information from users during conversations by conducting a randomized-controlled trial with 502 participants. We assess the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital and Cyber Forensics