An\'alise de ambiguidade lingu\'istica em modelos de linguagem de grande escala (LLMs)
Lav\'inia de Carvalho Moraes, Irene Cristina Silv\'erio, Rafael, Alexandre Sousa Marques, Bianca de Castro Anaia, Dandara Freitas de Paula,, Maria Carolina Schincariol de Faria, Iury Cleveston, Alana de Santana, Correia, Raquel Meister Ko Freitag

TL;DR
This study examines how large language models like ChatGPT and Gemini handle linguistic ambiguity in Brazilian Portuguese, revealing their limitations and the need for improved supervised learning methods.
Contribution
It provides a detailed analysis of semantic, syntactic, and lexical ambiguity in LLMs, including a new corpus and evaluation of model responses in Portuguese.
Findings
Models show errors and inconsistencies in ambiguity explanations.
Maximum accuracy achieved was 49.58%.
Highlights the need for further descriptive and supervised learning studies.
Abstract
Linguistic ambiguity continues to represent a significant challenge for natural language processing (NLP) systems, notwithstanding the advancements in architectures such as Transformers and BERT. Inspired by the recent success of instructional models like ChatGPT and Gemini (In 2023, the artificial intelligence was called Bard.), this study aims to analyze and discuss linguistic ambiguity within these models, focusing on three types prevalent in Brazilian Portuguese: semantic, syntactic, and lexical ambiguity. We create a corpus comprising 120 sentences, both ambiguous and unambiguous, for classification, explanation, and disambiguation. The models capability to generate ambiguous sentences was also explored by soliciting sets of sentences for each type of ambiguity. The results underwent qualitative analysis, drawing on recognized linguistic references, and quantitative assessment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Speech and dialogue systems
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Weight Decay · Linear Layer · Adam · Linear Warmup With Linear Decay · Layer Normalization · Multi-Head Attention · Dropout · Attention Dropout
