Sabi\'a-3 Technical Report

Hugo Abonizio; Thales Sales Almeida; Thiago Laitz; Roseval Malaquias; Junior; Giovana Kerche Bon\'as; Rodrigo Nogueira; Ramon Pires

arXiv:2410.12049·cs.CL·April 2, 2025·3 cites

Sabi\'a-3 Technical Report

Hugo Abonizio, Thales Sales Almeida, Thiago Laitz, Roseval Malaquias, Junior, Giovana Kerche Bon\'as, Rodrigo Nogueira, Ramon Pires

PDF

Open Access

TL;DR

This paper introduces Sabiá-3, a large Brazilian Portuguese language model that achieves high performance on various benchmarks at a significantly lower cost, emphasizing the advantages of domain-specific training.

Contribution

The paper presents Sabiá-3 and Sabiazinho-3, new language models trained on a Brazilian corpus, with Sabiá-3 outperforming previous models and matching frontier LLMs at reduced costs.

Findings

01

Sabiá-3 performs strongly on Portuguese and Brazil-related tasks.

02

Sabiá-3 matches frontier LLM performance levels.

03

Cost per token is three to four times lower than comparable models.

Abstract

This report presents Sabi\'a-3, our new flagship language model, and Sabiazinho-3, a more cost-effective sibling. The models were trained on a large brazilian-centric corpus. Evaluations across diverse professional and academic benchmarks show a strong performance on Portuguese and Brazil-related tasks. Sabi\'a-3 shows large improvements in comparison to our previous best of model, Sabia-2 Medium, especially in reasoning-intensive tasks. Notably, Sabi\'a-3's average performance matches frontier LLMs, while it is offered at a three to four times lower cost per token, reinforcing the benefits of domain specialization.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFluid Dynamics Simulations and Interactions · Historical Astronomy and Related Studies