Skill Extraction from Job Postings using Weak Supervision

Mike Zhang; Kristian N{\o}rgaard Jensen; Rob van der Goot; Barbara; Plank

arXiv:2209.08071·cs.CL·September 19, 2022·6 cites

Skill Extraction from Job Postings using Weak Supervision

Mike Zhang, Kristian N{\o}rgaard Jensen, Rob van der Goot, Barbara, Plank

PDF

Open Access 1 Repo

TL;DR

This paper introduces a weak supervision method for extracting skills from job postings using a taxonomy-based approach, reducing the need for costly annotations and outperforming traditional pattern-based methods.

Contribution

It presents a novel weak supervision technique leveraging a skills taxonomy to improve skill extraction from job ads without extensive labeled data.

Findings

01

Outperforms baseline token and pattern-based methods

02

Leverages European Skills taxonomy for better accuracy

03

Demonstrates strong positive signal in skill extraction

Abstract

Aggregated data obtained from job postings provide powerful insights into labor market demands, and emerging skills, and aid job matching. However, most extraction approaches are supervised and thus need costly and time-consuming annotation. To overcome this, we propose Skill Extraction with Weak Supervision. We leverage the European Skills, Competences, Qualifications and Occupations taxonomy to find similar skills in job ads via latent representations. The method shows a strong positive signal, outperforming baselines based on token-level and syntactic patterns.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jjzha/skill-extraction-weak-supervision
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques