NoteContrast: Contrastive Language-Diagnostic Pretraining for Medical   Text

Prajwal Kailas; Max Homilius; Rahul C. Deo; Calum A. MacRae

arXiv:2412.11477·cs.LG·December 17, 2024

NoteContrast: Contrastive Language-Diagnostic Pretraining for Medical Text

Prajwal Kailas, Max Homilius, Rahul C. Deo, Calum A. MacRae

PDF

Open Access 1 Repo

TL;DR

NoteContrast introduces a contrastive pretraining method that jointly models medical notes and diagnostic codes, significantly improving automated diagnostic coding accuracy in healthcare data analysis.

Contribution

The paper presents a novel contrastive pretraining approach combining language models and diagnostic codes, enhancing medical note interpretation and coding accuracy.

Findings

01

Outperforms previous models on MIMIC-III datasets

02

Improves sensitivity and specificity of diagnostic coding

03

Effective integration of text and code representations

Abstract

Accurate diagnostic coding of medical notes is crucial for enhancing patient care, medical research, and error-free billing in healthcare organizations. Manual coding is a time-consuming task for providers, and diagnostic codes often exhibit low sensitivity and specificity, whereas the free text in medical notes can be a more precise description of a patients status. Thus, accurate automated diagnostic coding of medical notes has become critical for a learning healthcare system. Recent developments in long-document transformer architectures have enabled attention-based deep-learning models to adjudicate medical notes. In addition, contrastive loss functions have been used to jointly pre-train large language and image models with noisy labels. To further improve the automated adjudication of medical notes, we developed an approach based on i) models for ICD-10 diagnostic code sequences…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

obi-ml-public/notecontrast
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification