Fauno: The Italian Large Language Model that will leave you senza parole!
Andrea Bacciu, Giovanni Trappolini, Andrea Santilli, Emanuele, Rodol\`a, Fabrizio Silvestri

TL;DR
Fauno is the first large open-source Italian conversational LLM, enabling accessible development of Italian conversational AI with a single GPU, supported by diverse datasets and open code.
Contribution
It introduces Fauno, the largest open-source Italian conversational LLM, and provides datasets and code to democratize Italian LLM research.
Findings
Fauno demonstrates effective conversational capabilities in Italian.
The model is fine-tuned on diverse Italian datasets.
Open-source release facilitates further research and development.
Abstract
This paper presents Fauno, the first and largest open-source Italian conversational Large Language Model (LLM). Our goal with Fauno is to democratize the study of LLMs in Italian, demonstrating that obtaining a fine-tuned conversational bot with a single GPU is possible. In addition, we release a collection of datasets for conversational AI in Italian. The datasets on which we fine-tuned Fauno include various topics such as general question answering, computer science, and medical questions. We release our code and datasets on \url{https://github.com/RSTLess-research/Fauno-Italian-LLM}
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
