BioBridge: Unified Bio-Embedding with Bridging Modality in Code-Switched EMR
Jangyeong Jeon, Sangyeon Cho, Dongjoon Lee, Changhee Lee, Junyeong Kim

TL;DR
BioBridge is a novel NLP framework designed to improve analysis of code-switched electronic medical records in pediatric emergency settings, enhancing decision-making accuracy by bridging language and domain gaps.
Contribution
It introduces a unified bio-embedding approach with a bridging modality to better understand bilingual and code-switched EMRs, improving over existing models.
Findings
BioBridge outperforms traditional machine learning models.
BioBridge-XLM shows significant improvements in F1, AUROC, and AUPRC.
Brier score decreases indicate better prediction calibration.
Abstract
Pediatric Emergency Department (PED) overcrowding presents a significant global challenge, prompting the need for efficient solutions. This paper introduces the BioBridge framework, a novel approach that applies Natural Language Processing (NLP) to Electronic Medical Records (EMRs) in written free-text form to enhance decision-making in PED. In non-English speaking countries, such as South Korea, EMR data is often written in a Code-Switching (CS) format that mixes the native language with English, with most code-switched English words having clinical significance. The BioBridge framework consists of two core modules: "bridging modality in context" and "unified bio-embedding." The "bridging modality in context" module improves the contextual understanding of bilingual and code-switched EMRs. In the "unified bio-embedding" module, the knowledge of the model trained in the medical domain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRFID technology advancements
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Dense Connections · Byte Pair Encoding · Multi-Head Attention · Residual Connection · Attention Dropout · Softmax · Adam · Layer Normalization
