Searching for Best Practices in Medical Transcription with Large Language Model
Jiafeng Li, Yanda Mu

TL;DR
This paper presents a novel application of Large Language Models to improve the accuracy of medical transcription from audio recordings with Indian accents, focusing on recognizing specialized terminology.
Contribution
It introduces a new LLM-based method specifically designed for medical transcription with accented speech, improving accuracy and recognition of medical terms.
Findings
Significant reduction in Word Error Rate (WER).
Enhanced recognition of medical terminology.
Improved overall transcription accuracy.
Abstract
The transcription of medical monologues, especially those containing a high density of specialized terminology and delivered with a distinct accent, presents a significant challenge for existing automated systems. This paper introduces a novel approach leveraging a Large Language Model (LLM) to generate highly accurate medical transcripts from audio recordings of doctors' monologues, specifically focusing on Indian accents. Our methodology integrates advanced language modeling techniques to lower the Word Error Rate (WER) and ensure the precise recognition of critical medical terms. Through rigorous testing on a comprehensive dataset of medical recordings, our approach demonstrates substantial improvements in both overall transcription accuracy and the fidelity of key medical terminologies. These results suggest that our proposed system could significantly aid in clinical documentation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling
