FedQAS: Privacy-aware machine reading comprehension with federated learning
Addi Ait-Mlouk, Sadi Alawadi, Salman Toor, Andreas Hellander

TL;DR
FedQAS introduces a federated learning-based system for machine reading comprehension that preserves privacy while enabling large-scale training on private datasets, utilizing transformer models and federated frameworks.
Contribution
It presents a novel privacy-aware MRC system combining federated learning and transformer models, allowing secure knowledge sharing without central data pooling.
Findings
Successfully deployed as a proof-of-concept with FEDn framework
Achieved competitive performance on SQuAD dataset
Demonstrated privacy preservation in federated MRC setting
Abstract
Machine reading comprehension (MRC) of text data is one important task in Natural Language Understanding. It is a complex NLP problem with a lot of ongoing research fueled by the release of the Stanford Question Answering Dataset (SQuAD) and Conversational Question Answering (CoQA). It is considered to be an effort to teach computers how to "understand" a text, and then to be able to answer questions about it using deep learning. However, until now large-scale training on private text data and knowledge sharing has been missing for this NLP task. Hence, we present FedQAS, a privacy-preserving machine reading system capable of leveraging large-scale private data without the need to pool those datasets in a central location. The proposed approach combines transformer models and federated learning technologies. The system is developed using the FEDn framework and deployed as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
