Examining the Behavior of LLM Architectures Within the Framework of Standardized National Exams in Brazil
Marcelo Sartori Locatelli, Matheus Prado Miranda, Igor Joaquim da, Silva Costa, Matheus Torres Prates, Victor Thom\'e, Mateus Zaparoli Monteiro,, Tomas Lacerda, Adriana Pagano, Eduardo Rios Neto, Wagner Meira Jr., Virgilio, Almeida

TL;DR
This study compares the performance and linguistic features of GPT-3.5, GPT-4, and MariTalk with Brazilian students on the ENEM exam to assess biases and differences in answer distributions and essay styles.
Contribution
It provides a novel analysis of LLMs in the context of Brazilian standardized exams, highlighting differences in answer patterns and linguistic characteristics compared to human students.
Findings
LLMs show no significant bias compared to humans based on answer accuracy.
Model essays differ from human essays in word choice and sentence structure.
LLMs' outputs do not closely resemble any specific human socioeconomic group.
Abstract
The Exame Nacional do Ensino M\'edio (ENEM) is a pivotal test for Brazilian students, required for admission to a significant number of universities in Brazil. The test consists of four objective high-school level tests on Math, Humanities, Natural Sciences and Languages, and one writing essay. Students' answers to the test and to the accompanying socioeconomic status questionnaire are made public every year (albeit anonymized) due to transparency policies from the Brazilian Government. In the context of large language models (LLMs), these data lend themselves nicely to comparing different groups of humans with AI, as we can have access to human and machine answer distributions. We leverage these characteristics of the ENEM dataset and compare GPT-3.5 and 4, and MariTalk, a model trained using Portuguese data, to humans, aiming to ascertain how their answers relate to real societal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStonefly species taxonomy and ecology · Brazilian Legal Issues · Border Security and International Relations
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Linear Layer · Attention Dropout · Residual Connection · Multi-Head Attention · {Dispute@FaQ-s}How to file a dispute with Expedia? · Cosine Annealing · Weight Decay
