When RAG Chatbots Expose Their Backend: An Anonymized Case Study of Privacy and Security Risks in Patient-Facing Medical AI
Alfredo Madrid-Garc\'ia, Miguel Rujas

TL;DR
This study reveals critical privacy and security vulnerabilities in a publicly accessible medical RAG chatbot, exposing sensitive data through client-side inspection, highlighting the need for rigorous independent review before deployment.
Contribution
It provides an anonymized case study demonstrating how standard browser tools can uncover security flaws in medical AI chatbots, emphasizing the importance of proper governance.
Findings
Sensitive configuration data was exposed via client-server communication.
Full chat histories, including health queries, were retrievable without authentication.
Browser inspection tools sufficed to identify serious privacy and security flaws.
Abstract
Background: Patient-facing medical chatbots based on retrieval-augmented generation (RAG) are increasingly promoted to deliver accessible, grounded health information. AI-assisted development lowers the barrier to building them, but they still demand rigorous security, privacy, and governance controls. Objective: To report an anonymized, non-destructive security assessment of a publicly accessible patient-facing medical RAG chatbot and identify governance lessons for safe deployment of generative AI in health. Methods: We used a two-stage strategy. First, Claude Opus 4.6 supported exploratory prompt-based testing and structured vulnerability hypotheses. Second, candidate findings were manually verified using Chrome Developer Tools, inspecting browser-visible network traffic, payloads, API schemas, configuration objects, and stored interaction data. Results: The LLM-assisted phase…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
