Automatic Lexical Simplification for Turkish

Ahmet Yavuz Uluslu

arXiv:2201.05878·cs.CL·September 22, 2025·1 cites

Automatic Lexical Simplification for Turkish

Ahmet Yavuz Uluslu

PDF

Open Access

TL;DR

This paper introduces the first automatic lexical simplification system for Turkish, leveraging pretrained BERT models and morphological features to handle the language's complexity.

Contribution

It presents a novel text simplification pipeline specifically designed for Turkish, addressing its morphological richness and resource limitations.

Findings

01

Effective simplification of Turkish words using the proposed method

02

Improved grammatical correctness and semantic appropriateness

03

Addresses challenges of low-resource, morphologically rich languages

Abstract

In this paper, we present the first automatic lexical simplification system for the Turkish language. Recent text simplification efforts rely on manually crafted simplified corpora and comprehensive NLP tools that can analyse the target text both in word and sentence levels. Turkish is a morphologically rich agglutinative language that requires unique considerations such as the proper handling of inflectional cases. Being a low-resource language in terms of available resources and industrial-strength tools, it makes the text simplification task harder to approach. We present a new text simplification pipeline based on pretrained representation model BERT together with morphological features to generate grammatically correct and semantically appropriate word-level simplifications.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification · Natural Language Processing Techniques · Topic Modeling

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Linear Warmup With Linear Decay · Softmax · Attention Dropout · Layer Normalization · Residual Connection · WordPiece · Adam