Introducing TrGLUE and SentiTurca: A Comprehensive Benchmark for Turkish General Language Understanding and Sentiment Analysis
Duygu Altinok

TL;DR
This paper introduces TrGLUE and SentiTurca, comprehensive benchmarks for Turkish language understanding and sentiment analysis, including datasets, evaluation protocols, and tools to advance NLP research in Turkish.
Contribution
It presents the first Turkish NLU benchmark (TrGLUE) and a sentiment analysis benchmark (SentiTurca), along with fine-tuning and evaluation code for transformer models.
Findings
TrGLUE covers diverse NLU tasks for Turkish.
SentiTurca provides a specialized sentiment analysis dataset.
The benchmarks enable effective evaluation of Turkish NLP models.
Abstract
Evaluating the performance of various model architectures, such as transformers, large language models (LLMs), and other NLP systems, requires comprehensive benchmarks that measure performance across multiple dimensions. Among these, the evaluation of natural language understanding (NLU) is particularly critical as it serves as a fundamental criterion for assessing model capabilities. Thus, it is essential to establish benchmarks that enable thorough evaluation and analysis of NLU abilities from diverse perspectives. While the GLUE benchmark has set a standard for evaluating English NLU, similar benchmarks have been developed for other languages, such as CLUE for Chinese, FLUE for French, and JGLUE for Japanese. However, no comparable benchmark currently exists for the Turkish language. To address this gap, we introduce TrGLUE, a comprehensive benchmark encompassing a variety of NLU…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Natural Language Processing Techniques
