Those Aren't Your Memories, They're Somebody Else's: Seeding Misinformation in Chat Bot Memories
Conor Atkins, Benjamin Zi Hao Zhao, Hassan Jameel Asghar, Ian Wood,, Mohamed Ali Kaafar

TL;DR
This paper reveals that chat bot long-term memories can be manipulated to store and recall misinformation, posing risks to factual accuracy and trustworthiness in conversational AI systems.
Contribution
The study demonstrates a novel vulnerability in chat bot memory mechanisms, showing how misinformation can be seeded and recalled as fact, with extensive empirical analysis on BlenderBot models.
Findings
76% of misinformation examples were remembered by BlenderBot 2
Misinformation recall increased by 328% when questioned on the topic
Large-scale analysis of 12,890 conversations confirms vulnerability
Abstract
One of the new developments in chit-chat bots is a long-term memory mechanism that remembers information from past conversations for increasing engagement and consistency of responses. The bot is designed to extract knowledge of personal nature from their conversation partner, e.g., stating preference for a particular color. In this paper, we show that this memory mechanism can result in unintended behavior. In particular, we found that one can combine a personal statement with an informative statement that would lead the bot to remember the informative statement alongside personal knowledge in its long term memory. This means that the bot can be tricked into remembering misinformation which it would regurgitate as statements of fact when recalling information relevant to the topic of conversation. We demonstrate this vulnerability on the BlenderBot 2 framework implemented on the ParlAI…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Personal Information Management and User Behavior · Topic Modeling
