De-identification of Patient Notes with Recurrent Neural Networks

Franck Dernoncourt; Ji Young Lee; Ozlem Uzuner; Peter Szolovits

arXiv:1606.03475·cs.CL·June 14, 2016

De-identification of Patient Notes with Recurrent Neural Networks

Franck Dernoncourt, Ji Young Lee, Ozlem Uzuner, Peter Szolovits

PDF

1 Repo

TL;DR

This paper presents a neural network-based system for automatically de-identifying patient notes in electronic health records, achieving state-of-the-art performance without manual feature engineering.

Contribution

It introduces the first ANN-based de-identification system that outperforms existing methods and does not require handcrafted features or rules.

Findings

01

Achieves F1-score of 97.85 on i2b2 2014 dataset

02

Achieves F1-score of 99.23 on MIMIC dataset

03

Outperforms previous state-of-the-art systems

Abstract

Objective: Patient notes in electronic health records (EHRs) may contain critical information for medical investigations. However, the vast majority of medical investigators can only access de-identified notes, in order to protect the confidentiality of patients. In the United States, the Health Insurance Portability and Accountability Act (HIPAA) defines 18 types of protected health information (PHI) that needs to be removed to de-identify patient notes. Manual de-identification is impractical given the size of EHR databases, the limited number of researchers with access to the non-de-identified notes, and the frequent mistakes of human annotators. A reliable automated de-identification system would consequently be of high value. Materials and Methods: We introduce the first de-identification system based on artificial neural networks (ANNs), which requires no handcrafted features or…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Franck-Dernoncourt/NeuroNER
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.