Improving the robustness and accuracy of biomedical language models through adversarial training
Milad Moradi, Matthias Samwald

TL;DR
This paper enhances biomedical transformer NLP models by applying adversarial training, significantly improving their robustness against adversarial attacks and also boosting their accuracy on clean data.
Contribution
It introduces adversarial training for biomedical NLP models, demonstrating improved robustness and accuracy, addressing security concerns in real-world applications.
Findings
Adversarial samples cause performance drops of 21% and 18.9%.
Adversarial training improves robustness by 11.3%.
Model accuracy on clean data increases by 2.4%.
Abstract
Deep transformer neural network models have improved the predictive accuracy of intelligent text processing systems in the biomedical domain. They have obtained state-of-the-art performance scores on a wide variety of biomedical and clinical Natural Language Processing (NLP) benchmarks. However, the robustness and reliability of these models has been less explored so far. Neural NLP models can be easily fooled by adversarial samples, i.e. minor changes to input that preserve the meaning and understandability of the text but force the NLP system to make erroneous decisions. This raises serious concerns about the security and trust-worthiness of biomedical NLP systems, especially when they are intended to be deployed in real-world use cases. We investigated the robustness of several transformer neural language models, i.e. BioBERT, SciBERT, BioMed-RoBERTa, and Bio-ClinicalBERT, on a wide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Artificial Intelligence in Healthcare and Education
