Adapting LLMs for the Medical Domain in Portuguese: A Study on Fine-Tuning and Model Evaluation
Pedro Henrique Paiola, Gabriel Lino Garcia, Jo\~ao Renato Ribeiro, Manesco, Mateus Roder, Douglas Rodrigues, Jo\~ao Paulo Papa

TL;DR
This paper evaluates Portuguese medical large language models, fine-tuning them with datasets translated from English, and compares their performance, highlighting challenges like catastrophic forgetting and the need for better evaluation protocols.
Contribution
It demonstrates the effectiveness of the InternLM2 model for medical tasks in Portuguese and analyzes the impact of fine-tuning methods and dataset translation on model performance.
Findings
InternLM2 achieved the best overall performance.
DrBode models showed catastrophic forgetting of medical knowledge.
Low inter-rater agreement indicates need for better evaluation methods.
Abstract
This study evaluates the performance of large language models (LLMs) as medical agents in Portuguese, aiming to develop a reliable and relevant virtual assistant for healthcare professionals. The HealthCareMagic-100k-en and MedQuAD datasets, translated from English using GPT-3.5, were used to fine-tune the ChatBode-7B model using the PEFT-QLoRA method. The InternLM2 model, with initial training on medical data, presented the best overall performance, with high precision and adequacy in metrics such as accuracy, completeness and safety. However, DrBode models, derived from ChatBode, exhibited a phenomenon of catastrophic forgetting of acquired medical knowledge. Despite this, these models performed frequently or even better in aspects such as grammaticality and coherence. A significant challenge was low inter-rater agreement, highlighting the need for more robust assessment protocols.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicslinguistics and terminology studies · Natural Language Processing Techniques · Semantic Web and Ontologies
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Cosine Annealing · Layer Normalization · Linear Warmup With Cosine Annealing · Adam · Linear Layer · Residual Connection · Weight Decay
