Audio De-identification: A New Entity Recognition Task

Ido Cohn; Itay Laish; Genady Beryozkin; Gang Li; Izhak Shafran; Idan; Szpektor; Tzvika Hartman; Avinatan Hassidim; Yossi Matias

arXiv:1903.07037·cs.CL·May 7, 2019·5 cites

Audio De-identification: A New Entity Recognition Task

Ido Cohn, Itay Laish, Genady Beryozkin, Gang Li, Izhak Shafran, Idan, Szpektor, Tzvika Hartman, Avinatan Hassidim, Yossi Matias

PDF

Open Access

TL;DR

This paper introduces the task of audio de-identification, combining speech recognition and entity recognition to detect and redact personal information in medical conversation recordings, and provides a new benchmark for evaluation.

Contribution

It defines the novel task of audio de-ID, proposes a pipeline integrating ASR and NER with alignment, and introduces a new evaluation metric and benchmark dataset.

Findings

01

Pipeline achieves promising detection accuracy.

02

New metric effectively evaluates audio de-ID performance.

03

Benchmark dataset enables standardized evaluation.

Abstract

Named Entity Recognition (NER) has been mostly studied in the context of written text. Specifically, NER is an important step in de-identification (de-ID) of medical records, many of which are recorded conversations between a patient and a doctor. In such recordings, audio spans with personal information should be redacted, similar to the redaction of sensitive character spans in de-ID for written text. The application of NER in the context of audio de-identification has yet to be fully investigated. To this end, we define the task of audio de-ID, in which audio spans with entity mentions should be detected. We then present our pipeline for this task, which involves Automatic Speech Recognition (ASR), NER on the transcript text, and text-to-audio alignment. Finally, we introduce a novel metric for audio de-ID and a new evaluation benchmark consisting of a large labeled segment of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis