Transfer Learning for Named-Entity Recognition with Neural Networks

Ji Young Lee; Franck Dernoncourt; Peter Szolovits

arXiv:1705.06273·cs.CL·May 18, 2017·83 cites

Transfer Learning for Named-Entity Recognition with Neural Networks

Ji Young Lee, Franck Dernoncourt, Peter Szolovits

PDF

Open Access

TL;DR

This paper explores how transfer learning can improve neural network-based named-entity recognition, especially in scenarios with limited labeled data, by transferring knowledge from large datasets to specialized tasks like patient note de-identification.

Contribution

The study demonstrates that transfer learning significantly enhances NER performance on small datasets, particularly for sensitive applications like medical record de-identification.

Findings

01

Transfer learning improves NER accuracy with limited labels.

02

Transfer from large datasets outperforms training from scratch.

03

Effective for patient note de-identification.

Abstract

Recent approaches based on artificial neural networks (ANNs) have shown promising results for named-entity recognition (NER). In order to achieve high performances, ANNs need to be trained on a large labeled dataset. However, labels might be difficult to obtain for the dataset on which the user wants to perform NER: label scarcity is particularly pronounced for patient note de-identification, which is an instance of NER. In this work, we analyze to what extent transfer learning may address this issue. In particular, we demonstrate that transferring an ANN model trained on a large labeled dataset to another dataset with a limited number of labels improves upon the state-of-the-art results on two different datasets for patient note de-identification.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Topic Modeling · Electronic Health Records Systems