Clinical Dialogue Transcription Error Correction using Seq2Seq Models
Gayani Nanayakkara, Nirmalie Wiratunga, David Corsar, Kyle Martin,, Anjana Wijekoon

TL;DR
This paper introduces a seq2seq model approach for correcting transcription errors in clinical dialogues, utilizing a new dataset and domain-specific fine-tuning to improve accuracy of speech-to-text systems in healthcare settings.
Contribution
The study presents a novel seq2seq error correction method, a new clinical dialogue dataset, and demonstrates improved transcription accuracy through domain-specific fine-tuning.
Findings
BART-based model reduces word error rates in clinical ASR outputs
Introduces a new gastrointestinal clinical dialogue dataset
Shows effectiveness of domain-specific fine-tuning for error correction
Abstract
Good communication is critical to good healthcare. Clinical dialogue is a conversation between health practitioners and their patients, with the explicit goal of obtaining and sharing medical information. This information contributes to medical decision-making regarding the patient and plays a crucial role in their healthcare journey. The reliance on note taking and manual scribing processes are extremely inefficient and leads to manual transcription errors when digitizing notes. Automatic Speech Recognition (ASR) plays a significant role in speech-to-text applications, and can be directly used as a text generator in conversational applications. However, recording clinical dialogue presents a number of general and domain-specific challenges. In this paper, we present a seq2seq learning approach for ASR transcription error correction of clinical dialogues. We introduce a new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInterpreting and Communication in Healthcare · Speech and dialogue systems · Topic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Sigmoid Activation · Adam · Tanh Activation · Long Short-Term Memory · Residual Connection · Layer Normalization
