Advanced Natural-based interaction for the ITAlian language: LLaMAntino-3-ANITA
Marco Polignano, Pierpaolo Basile, Giovanni Semeraro

TL;DR
This paper introduces LLaMAntino-3-ANITA, a state-of-the-art Italian language model based on Meta's LLaMA-3, fine-tuned with advanced techniques to improve performance, safety, and efficiency for various NLP tasks.
Contribution
The paper presents a novel fine-tuning and optimization pipeline for Italian language modeling, combining SFT, QLoRA, and DPO techniques to enhance performance and safety.
Findings
Achieved significant performance improvements on Italian benchmarks.
Demonstrated efficient fine-tuning with reduced computational resources.
Produced a safe, aligned model suitable for diverse NLP tasks.
Abstract
In the pursuit of advancing natural language processing for the Italian language, we introduce a state-of-the-art Large Language Model (LLM) based on the novel Meta LLaMA-3 model: LLaMAntino-3-ANITA-8B-Inst-DPO-ITA. We fine-tuned the original 8B parameters instruction tuned model using the Supervised Fine-tuning (SFT) technique on the English and Italian language datasets in order to improve the original performance. Consequently, a Dynamic Preference Optimization (DPO) process has been used to align preferences, avoid dangerous and inappropriate answers, and limit biases and prejudices. Our model leverages the efficiency of QLoRA to fine-tune the model on a smaller portion of the original model weights and then adapt the model specifically for the Italian linguistic structure, achieving significant improvements in both performance and computational efficiency. Concurrently, DPO is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗swap-uniba/LLaMAntino-3-ANITA-8B-Inst-DPO-ITAmodel· 12k dl· ♡ 3012k dl♡ 30
- 🤗swap-uniba/LLaMAntino-3-ANITA-8B-Inst-DPO-ITA_GGUFmodel· 48 dl· ♡ 448 dl♡ 4
- 🤗swap-uniba/LLaMAntino-3-ANITA-8B-Inst-DPO-ITA_EXL2model
- 🤗RichardErkhov/swap-uniba_-_LLaMAntino-3-ANITA-8B-Inst-DPO-ITA-ggufmodel· 38 dl38 dl
- 🤗fakezeta/LLaMAntino-3-ANITA-8B-Inst-DPO-ITA-ov-int4model· 1 dl1 dl
- 🤗fakezeta/LLaMAntino-3-ANITA-8B-Inst-DPO-ITA-ov-int8model· 2 dl2 dl
- 🤗m-polignano/ANITA-NEXT-24B-Dolphin-Mistral-UNCENSORED-ITAmodel· 65 dl· ♡ 865 dl♡ 8
- 🤗m-polignano/ANITA-NEXT-24B-Magistral-2506-ITAmodel· 13 dl· ♡ 313 dl♡ 3
- 🤗m-polignano/ANITA-NEXT-24B-Magistral-2506-VISION-ITAmodel· 11 dl· ♡ 711 dl♡ 7
- 🤗m-polignano/ANITA-NEXT-24B-Magistral-2506-ITA-GGUFmodel· 74 dl74 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLinguistic Studies and Language Acquisition · Natural Language Processing Techniques · Speech and dialogue systems
MethodsDirect Preference Optimization · ALIGN · Shrink and Fine-Tune
