Priberam at MESINESP Multi-label Classification of Medical Texts Task
Ruben Cardoso, Zita Marinho, Afonso Mendes, Sebasti\~ao Miranda

TL;DR
This paper describes Priberam's approach to the MESINESP multi-label classification challenge, employing multiple models including SVM, BERT, and a search engine, with ensemble methods achieving top performance in medical article classification.
Contribution
The work introduces a multi-model ensemble approach to large-scale multi-label medical text classification, improving performance in a challenging hierarchical label space.
Findings
Ensemble model achieved the best classification performance.
All individual models performed well independently.
Priberam ranked 6th overall and 2nd among teams in the challenge.
Abstract
Medical articles provide current state of the art treatments and diagnostics to many medical practitioners and professionals. Existing public databases such as MEDLINE contain over 27 million articles, making it difficult to extract relevant content without the use of efficient search engines. Information retrieval tools are crucial in order to navigate and provide meaningful recommendations for articles and treatments. Classifying these articles into broader medical topics can improve the retrieval of related articles. The set of medical labels considered for the MESINESP task is on the order of several thousands of labels (DeCS codes), which falls under the extreme multi-label classification problem. The heterogeneous and highly hierarchical structure of medical topics makes the task of manually classifying articles extremely laborious and costly. It is, therefore, crucial to automate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Topic Modeling · Advanced Text Analysis Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Layer Normalization · Linear Warmup With Linear Decay · Softmax · Multi-Head Attention · Residual Connection · WordPiece · Weight Decay
