EriBERTa: A Bilingual Pre-Trained Language Model for Clinical Natural Language Processing
Iker de la Iglesia, Aitziber Atutxa, Koldo Gojenola, Ander, Barrena

TL;DR
EriBERTa is a bilingual pre-trained language model designed for clinical NLP in Spanish, outperforming previous models and demonstrating strong transfer learning capabilities across languages.
Contribution
Introduces EriBERTa, a novel bilingual clinical language model trained on extensive medical data, enhancing Spanish clinical NLP applications.
Findings
EriBERTa outperforms previous Spanish clinical language models.
EriBERTa shows strong transfer learning abilities between languages.
EriBERTa effectively understands and extracts information from medical texts.
Abstract
The utilization of clinical reports for various secondary purposes, including health research and treatment monitoring, is crucial for enhancing patient care. Natural Language Processing (NLP) tools have emerged as valuable assets for extracting and processing relevant information from these reports. However, the availability of specialized language models for the clinical domain in Spanish has been limited. In this paper, we introduce EriBERTa, a bilingual domain-specific language model pre-trained on extensive medical and clinical corpora. We demonstrate that EriBERTa outperforms previous Spanish language models in the clinical domain, showcasing its superior capabilities in understanding medical texts and extracting meaningful information. Moreover, EriBERTa exhibits promising transfer learning abilities, allowing for knowledge transfer from one language to another. This aspect is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗HiTZ/EriBERTa-basemodel· 374 dl· ♡ 3374 dl♡ 3
- 🤗medspaner/EriBERTa-clinical-trials-7sgs-umlsmodel· 2 dl2 dl
- 🤗medspaner/EriBERTa-clinical-trials-temp-entsmodel· 2 dl2 dl
- 🤗medspaner/EriBERTa-clinical-trials-medic-attrmodel· 4 dl4 dl
- 🤗medspaner/EriBERTa-clinical-trials-neg-specmodel
- 🤗medspaner/EriBERTa-clinical-trials-misc-entsmodel· 1 dl1 dl
- 🤗medspaner/EriBERTa-clinical-trials-attributesmodel· 1 dl1 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
