A practical method for occupational skills detection in Vietnamese job   listings

Viet-Trung Tran; Hai-Nam Cao; Tuan-Dung Cao

arXiv:2210.14607·cs.CL·October 27, 2022

A practical method for occupational skills detection in Vietnamese job listings

Viet-Trung Tran, Hai-Nam Cao, Tuan-Dung Cao

PDF

Open Access

TL;DR

This paper presents a practical, ranking-based method for detecting occupational skills in Vietnamese job listings, addressing data scarcity issues and outperforming traditional NER approaches.

Contribution

It introduces a novel ranking-based pipeline for skill detection that reduces reliance on large annotated datasets and improves accuracy in scarce data scenarios.

Findings

01

The proposed method outperforms NER models on limited datasets.

02

The pipeline effectively extracts and ranks skill phrases in context.

03

Experiments on three datasets validate the approach's robustness.

Abstract

Vietnamese labor market has been under an imbalanced development. The number of university graduates is growing, but so is the unemployment rate. This situation is often caused by the lack of accurate and timely labor market information, which leads to skill miss-matches between worker supply and the actual market demands. To build a data monitoring and analytic platform for the labor market, one of the main challenges is to be able to automatically detect occupational skills from labor-related data, such as resumes and job listings. Traditional approaches rely on existing taxonomy and/or large annotated data to build Named Entity Recognition (NER) models. They are expensive and require huge manual efforts. In this paper, we propose a practical methodology for skill detection in Vietnamese job listings. Rather than viewing the task as a NER task, we consider the task as a ranking…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies