An In-Depth Evaluation of Federated Learning on Biomedical Natural   Language Processing

Le Peng; Gaoxiang Luo; sicheng zhou; jiandong chen; Rui Zhang; Ziyue; Xu; Ju Sun

arXiv:2307.11254·cs.CL·November 14, 2023

An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing

Le Peng, Gaoxiang Luo, sicheng zhou, jiandong chen, Rui Zhang, Ziyue, Xu, Ju Sun

PDF

Open Access 2 Repos

TL;DR

This paper evaluates federated learning's effectiveness on biomedical NLP tasks, demonstrating its advantages in privacy-preserving collaborative training, resilience, and inference speed compared to traditional and large language models.

Contribution

It provides a comprehensive evaluation of federated learning on biomedical NLP, highlighting its performance benefits and robustness across multiple tasks and models.

Findings

01

FL models outperform individual client models

02

FL models sometimes match polled data models

03

Pre-trained transformers show resilience in FL settings

Abstract

Language models (LMs) such as BERT and GPT have revolutionized natural language processing (NLP). However, the medical field faces challenges in training LMs due to limited data access and privacy constraints imposed by regulations like the Health Insurance Portability and Accountability Act (HIPPA) and the General Data Protection Regulation (GDPR). Federated learning (FL) offers a decentralized solution that enables collaborative learning while ensuring data privacy. In this study, we evaluated FL on 2 biomedical NLP tasks encompassing 8 corpora using 6 LMs. Our results show that: 1) FL models consistently outperformed models trained on individual clients' data and sometimes performed comparably with models trained with polled data; 2) with the fixed number of total data, FL models training with more clients produced inferior performance but pre-trained transformer-based models…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Artificial Intelligence in Healthcare and Education

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Discriminative Fine-Tuning · Cosine Annealing · Softmax · Dense Connections · Weight Decay · Dropout