Transfer Learning for Named-Entity Recognition with Neural Networks
Ji Young Lee, Franck Dernoncourt, Peter Szolovits

TL;DR
This paper explores how transfer learning can improve neural network-based named-entity recognition, especially in scenarios with limited labeled data, by transferring knowledge from large datasets to specialized tasks like patient note de-identification.
Contribution
The study demonstrates that transfer learning significantly enhances NER performance on small datasets, particularly for sensitive applications like medical record de-identification.
Findings
Transfer learning improves NER accuracy with limited labels.
Transfer from large datasets outperforms training from scratch.
Effective for patient note de-identification.
Abstract
Recent approaches based on artificial neural networks (ANNs) have shown promising results for named-entity recognition (NER). In order to achieve high performances, ANNs need to be trained on a large labeled dataset. However, labels might be difficult to obtain for the dataset on which the user wants to perform NER: label scarcity is particularly pronounced for patient note de-identification, which is an instance of NER. In this work, we analyze to what extent transfer learning may address this issue. In particular, we demonstrate that transferring an ANN model trained on a large labeled dataset to another dataset with a limited number of labels improves upon the state-of-the-art results on two different datasets for patient note de-identification.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Topic Modeling · Electronic Health Records Systems
