BadCLM: Backdoor Attack in Clinical Language Models for Electronic   Health Records

Weimin Lyu; Zexin Bi; Fusheng Wang; Chao Chen

arXiv:2407.05213·cs.CL·July 9, 2024·3 cites

BadCLM: Backdoor Attack in Clinical Language Models for Electronic Health Records

Weimin Lyu, Zexin Bi, Fusheng Wang, Chao Chen

PDF

Open Access

TL;DR

This paper reveals vulnerabilities in clinical language models used in electronic health records by introducing BadCLM, a backdoor attack method that manipulates model outputs with specific triggers, highlighting security risks in clinical decision support.

Contribution

The paper presents BadCLM, an innovative attention-based backdoor attack technique specifically designed for clinical language models, demonstrating its effectiveness on real-world medical data.

Findings

01

BadCLM successfully manipulates model predictions with triggers

02

The attack compromises model integrity in clinical decision tasks

03

Security risks in clinical language models are significant and urgent

Abstract

The advent of clinical language models integrated into electronic health records (EHR) for clinical decision support has marked a significant advancement, leveraging the depth of clinical notes for improved decision-making. Despite their success, the potential vulnerabilities of these models remain largely unexplored. This paper delves into the realm of backdoor attacks on clinical language models, introducing an innovative attention-based backdoor attack method, BadCLM (Bad Clinical Language Models). This technique clandestinely embeds a backdoor within the models, causing them to produce incorrect predictions when a pre-defined trigger is present in inputs, while functioning accurately otherwise. We demonstrate the efficacy of BadCLM through an in-hospital mortality prediction task with MIMIC III dataset, showcasing its potential to compromise model integrity. Our findings illuminate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Topic Modeling · Artificial Intelligence in Healthcare and Education