Leveraging LLMs For Turkish Skill Extraction

Ezgi Arslan \.Ilt\"uzer; \"Ozg\"ur An{\i}l \"Ozl\"u; Vahid Farajijobehdar; G\"ul\c{s}en Eryi\u{g}it

arXiv:2601.22885·cs.CL·February 2, 2026

Leveraging LLMs For Turkish Skill Extraction

Ezgi Arslan \.Ilt\"uzer, \"Ozg\"ur An{\i}l \"Ozl\"u, Vahid Farajijobehdar, G\"ul\c{s}en Eryi\u{g}it

PDF

Open Access

TL;DR

This paper introduces the first Turkish skill extraction dataset and evaluates LLMs, demonstrating their effectiveness in extracting skills from Turkish job postings, a low-resource language, outperforming traditional methods.

Contribution

It presents the first Turkish skill extraction dataset and assesses LLM performance, highlighting effective prompting strategies and model configurations for low-resource language skill extraction.

Findings

01

LLMs outperform supervised sequence labeling in skill extraction.

02

Dynamic few-shot prompting with Claude Sonnet 3.7 yields best results.

03

Turkish skill extraction performance aligns with other languages, advancing low-resource NLP.

Abstract

Skill extraction is a critical component of modern recruitment systems, enabling efficient job matching, personalized recommendations, and labor market analysis. Despite T\"urkiye's significant role in the global workforce, Turkish, a morphologically complex language, lacks both a skill taxonomy and a dedicated skill extraction dataset, resulting in underexplored research in skill extraction for Turkish. This article seeks the answers to three research questions: 1) How can skill extraction be effectively performed for this language, in light of its low resource nature? 2)~What is the most promising model? 3) What is the impact of different Large Language Models (LLMs) and prompting strategies on skill extraction (i.e., dynamic vs. static few-shot samples, varying context information, and encouraging causal reasoning)? The article introduces the first Turkish skill extraction dataset…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Expert finding and Q&A systems · Text Readability and Simplification