Multi-objective Representation for Numbers in Clinical Narratives: A CamemBERT-Bio-Based Alternative to Large-Scale LLMs
Boammani Aser Lompo, Thanh-Dung Le

TL;DR
This paper explores methods to improve numerical value understanding in clinical texts using CamemBERT-bio, introducing label embedding and multiple number representations to enhance classification accuracy in healthcare NLP tasks.
Contribution
It presents two novel techniques—label embedding for self-attention and multiple number representations—to improve transformer models' handling of numerical data in medical narratives.
Findings
Fine-tuning alone did not improve performance.
LESA significantly increased F1 scores by over 13%.
Combining LESA with Xval outperformed traditional methods and matched GPT-4 results.
Abstract
The processing of numerical values is a rapidly developing area in the field of Language Models (LLMs). Despite numerous advancements achieved by previous research, significant challenges persist, particularly within the healthcare domain. This paper investigates the limitations of Transformer models in understanding numerical values. \textit{Objective:} this research aims to categorize numerical values extracted from medical documents into eight specific physiological categories using CamemBERT-bio. \textit{Methods:} In a context where scalable methods and Large Language Models (LLMs) are emphasized, we explore lifting the limitations of transformer-based models. We examine two strategies: fine-tuning CamemBERT-bio on a small medical dataset, integrating Label Embedding for Self-Attention (LESA), and combining LESA with additional enhancement techniques such as Xval. Given that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematics, Computing, and Information Processing · Educational Assessment and Pedagogy · Machine Learning in Healthcare
MethodsAttention Is All You Need · Byte Pair Encoding · Dense Connections · Residual Connection · Linear Layer · Absolute Position Encodings · Layer Normalization · Label Smoothing · Multi-Head Attention · Position-Wise Feed-Forward Layer
