The NLP Sandbox: an efficient model-to-data system to enable federated and unbiased evaluation of clinical NLP models
Yao Yan, Thomas Yu, Kathleen Muenzen, Sijia Liu, Connor Boyle, George, Koslowski, Jiaxin Zheng, Nicholas Dobbins, Clement Essien, Hongfang Liu,, Larsson Omberg, Meliha Yestigen, Bradley Taylor, James A Eddy, Justin, Guinney, Sean Mooney, Thomas Schaffter

TL;DR
The NLP Sandbox enables federated, unbiased evaluation of clinical NLP models across multiple institutions without sharing sensitive data, using a standardized, model-to-data approach that enhances model assessment and collaboration.
Contribution
It introduces a federated evaluation system for clinical NLP models leveraging containerization and standardized schemas, allowing multi-site testing without data sharing.
Findings
Successful evaluation of de-identification models across three institutions
External validation demonstrated generalizability of models
The system facilitated user feedback and model integration
Abstract
Objective The evaluation of natural language processing (NLP) models for clinical text de-identification relies on the availability of clinical notes, which is often restricted due to privacy concerns. The NLP Sandbox is an approach for alleviating the lack of data and evaluation frameworks for NLP models by adopting a federated, model-to-data approach. This enables unbiased federated model evaluation without the need for sharing sensitive data from multiple institutions. Materials and Methods We leveraged the Synapse collaborative framework, containerization software, and OpenAPI generator to build the NLP Sandbox (nlpsandbox.io). We evaluated two state-of-the-art NLP de-identification focused annotation models, Philter and NeuroNER, using data from three institutions. We further validated model performance using data from an external validation site. Results We demonstrated the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsElectronic Health Records Systems · Biomedical Text Mining and Ontologies · Topic Modeling
