CLiGNet: Clinical Label-Interaction Graph Network for Medical Specialty Classification from Clinical Transcriptions
Pronob Kumar Barman, Pronoy Kumar Barman

TL;DR
This paper introduces CLiGNet, a novel neural architecture combining clinical text encoding and label interaction graphs, to improve medical specialty classification from clinical transcriptions while addressing data leakage issues.
Contribution
The paper establishes a leakage-free benchmark for medical specialty classification and proposes CLiGNet, which leverages a label graph and clinical BERT encoding for improved performance.
Findings
CLiGNet achieves the highest macro F1 score among tested models.
The label graph significantly improves classification performance.
Calibration with Platt scaling yields reliable probability estimates.
Abstract
Automated classification of clinical transcriptions into medical specialties is essential for routing, coding, and clinical decision support, yet prior work on the widely used MTSamples benchmark suffers from severe data leakage caused by applying SMOTE oversampling before train test splitting. We first document this methodological flaw and establish a leakage free benchmark across 40 medical specialties (4966 records), revealing that the true task difficulty is substantially higher than previously reported. We then introduce CLiGNet (Clinical Label Interaction Graph Network), a neural architecture that combines a Bio ClinicalBERT text encoder with a two layer Graph Convolutional Network operating on a specialty label graph constructed from semantic similarity and ICD 10 chapter priors. Per label attention gates fuse document and label graph representations, trained with focal binary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Advanced Graph Neural Networks · Topic Modeling
