Leveraging LLMs For Turkish Skill Extraction
Ezgi Arslan \.Ilt\"uzer, \"Ozg\"ur An{\i}l \"Ozl\"u, Vahid Farajijobehdar, G\"ul\c{s}en Eryi\u{g}it

TL;DR
This paper introduces the first Turkish skill extraction dataset and evaluates LLMs, demonstrating their effectiveness in extracting skills from Turkish job postings, a low-resource language, outperforming traditional methods.
Contribution
It presents the first Turkish skill extraction dataset and assesses LLM performance, highlighting effective prompting strategies and model configurations for low-resource language skill extraction.
Findings
LLMs outperform supervised sequence labeling in skill extraction.
Dynamic few-shot prompting with Claude Sonnet 3.7 yields best results.
Turkish skill extraction performance aligns with other languages, advancing low-resource NLP.
Abstract
Skill extraction is a critical component of modern recruitment systems, enabling efficient job matching, personalized recommendations, and labor market analysis. Despite T\"urkiye's significant role in the global workforce, Turkish, a morphologically complex language, lacks both a skill taxonomy and a dedicated skill extraction dataset, resulting in underexplored research in skill extraction for Turkish. This article seeks the answers to three research questions: 1) How can skill extraction be effectively performed for this language, in light of its low resource nature? 2)~What is the most promising model? 3) What is the impact of different Large Language Models (LLMs) and prompting strategies on skill extraction (i.e., dynamic vs. static few-shot samples, varying context information, and encouraging causal reasoning)? The article introduces the first Turkish skill extraction dataset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Expert finding and Q&A systems · Text Readability and Simplification
