Label-aware Double Transfer Learning for Cross-Specialty Medical Named Entity Recognition
Zhenghui Wang, Yanru Qu, Liheng Chen, Jian Shen, Weinan Zhang,, Shaodian Zhang, Yimei Gao, Gen Gu, Ken Chen, Yong Yu

TL;DR
This paper introduces La-DTL, a transfer learning framework that enables effective cross-specialty medical NER with minimal annotation, using label-aware feature and parameter transfer techniques, validated on 12 diverse tasks.
Contribution
The paper presents a novel label-aware double transfer learning method for cross-specialty medical NER, combining feature and parameter transfer with theoretical guarantees.
Findings
La-DTL improves NER accuracy across 12 medical specialties.
It demonstrates strong transferability to non-medical NER tasks.
The approach reduces annotation efforts for new medical specialties.
Abstract
We study the problem of named entity recognition (NER) from electronic medical records, which is one of the most fundamental and critical problems for medical text mining. Medical records which are written by clinicians from different specialties usually contain quite different terminologies and writing styles. The difference of specialties and the cost of human annotation makes it particularly difficult to train a universal medical NER system. In this paper, we propose a label-aware double transfer learning framework (La-DTL) for cross-specialty NER, so that a medical NER system designed for one specialty could be conveniently applied to another one with minimal annotation efforts. The transferability is guaranteed by two components: (i) we propose label-aware MMD for feature representation transfer, and (ii) we perform parameter transfer with a theoretical upper bound which is also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Natural Language Processing Techniques
