MedAttacker: Exploring Black-Box Adversarial Attacks on Risk Prediction Models in Healthcare
Muchao Ye, Junyu Luo, Guanjie Zheng, Cao Xiao, Ting Wang, and Fenglong Ma

TL;DR
This paper introduces MedAttacker, a novel black-box adversarial attack method targeting health risk prediction models using EHR data, revealing their vulnerability and outperforming some white-box attacks in certain scenarios.
Contribution
MedAttacker is the first black-box attack method specifically designed for health risk prediction models with EHR data, combining reinforcement learning and score-based strategies.
Findings
MedAttacker achieves the highest success rate in black-box attacks.
It outperforms recent white-box attack techniques in some cases.
The study discusses potential defenses against EHR adversarial attacks.
Abstract
Deep neural networks (DNNs) have been broadly adopted in health risk prediction to provide healthcare diagnoses and treatments. To evaluate their robustness, existing research conducts adversarial attacks in the white/gray-box setting where model parameters are accessible. However, a more realistic black-box adversarial attack is ignored even though most real-world models are trained with private data and released as black-box services on the cloud. To fill this gap, we propose the first black-box adversarial attack method against health risk prediction models named MedAttacker to investigate their vulnerability. MedAttacker addresses the challenges brought by EHR data via two steps: hierarchical position selection which selects the attacked positions in a reinforcement learning (RL) framework and substitute selection which identifies substitute with a score-based principle.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
