Exploring deep learning methods for recognizing rare diseases and their clinical manifestations from texts
Isabel Segura-Bedmar, David Camino-Perdonas, Sara Guerrero-Aspizua

TL;DR
This paper investigates deep learning models, especially BioBERT, for recognizing rare diseases and their clinical signs from texts, aiming to improve diagnosis accuracy using NLP techniques on a specialized corpus.
Contribution
It demonstrates the effectiveness of deep learning, particularly BioBERT, in extracting rare disease information from biomedical texts, outperforming other models.
Findings
BioBERT achieved an F1-score of 85.2% for rare disease recognition.
Deep learning models significantly improve extraction accuracy over traditional methods.
The study highlights the potential of NLP in aiding rare disease diagnosis.
Abstract
Although rare diseases are characterized by low prevalence, approximately 300 million people are affected by a rare disease. The early and accurate diagnosis of these conditions is a major challenge for general practitioners, who do not have enough knowledge to identify them. In addition to this, rare diseases usually show a wide variety of manifestations, which might make the diagnosis even more difficult. A delayed diagnosis can negatively affect the patient's life. Therefore, there is an urgent need to increase the scientific and medical knowledge about rare diseases. Natural Language Processing (NLP) and Deep Learning can help to extract relevant information about rare diseases to facilitate their diagnosis and treatments. The paper explores the use of several deep learning techniques such as Bidirectional Long Short Term Memory (BiLSTM) networks or deep contextualized word…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Genomics and Rare Diseases · Translation Studies and Practices
MethodsAttention Is All You Need · Linear Layer · Residual Connection · Layer Normalization · Dropout · Softmax · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Multi-Head Attention · Linear Warmup With Linear Decay
