Advanced Natural-based interaction for the ITAlian language:   LLaMAntino-3-ANITA

Marco Polignano; Pierpaolo Basile; Giovanni Semeraro

arXiv:2405.07101·cs.CL·May 14, 2024·5 cites

Advanced Natural-based interaction for the ITAlian language: LLaMAntino-3-ANITA

Marco Polignano, Pierpaolo Basile, Giovanni Semeraro

PDF

Open Access 10 Models

TL;DR

This paper introduces LLaMAntino-3-ANITA, a state-of-the-art Italian language model based on Meta's LLaMA-3, fine-tuned with advanced techniques to improve performance, safety, and efficiency for various NLP tasks.

Contribution

The paper presents a novel fine-tuning and optimization pipeline for Italian language modeling, combining SFT, QLoRA, and DPO techniques to enhance performance and safety.

Findings

01

Achieved significant performance improvements on Italian benchmarks.

02

Demonstrated efficient fine-tuning with reduced computational resources.

03

Produced a safe, aligned model suitable for diverse NLP tasks.

Abstract

In the pursuit of advancing natural language processing for the Italian language, we introduce a state-of-the-art Large Language Model (LLM) based on the novel Meta LLaMA-3 model: LLaMAntino-3-ANITA-8B-Inst-DPO-ITA. We fine-tuned the original 8B parameters instruction tuned model using the Supervised Fine-tuning (SFT) technique on the English and Italian language datasets in order to improve the original performance. Consequently, a Dynamic Preference Optimization (DPO) process has been used to align preferences, avoid dangerous and inappropriate answers, and limit biases and prejudices. Our model leverages the efficiency of QLoRA to fine-tune the model on a smaller portion of the original model weights and then adapt the model specifically for the Italian linguistic structure, achieving significant improvements in both performance and computational efficiency. Concurrently, DPO is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLinguistic Studies and Language Acquisition · Natural Language Processing Techniques · Speech and dialogue systems

MethodsDirect Preference Optimization · ALIGN · Shrink and Fine-Tune