"FIJO": a French Insurance Soft Skill Detection Dataset

David Beauchemin; Julien Laumonier; Yvan Le Ster; Marouane; Yassine

arXiv:2204.05208·cs.CL·August 1, 2022·1 cites

"FIJO": a French Insurance Soft Skill Detection Dataset

David Beauchemin, Julien Laumonier, Yvan Le Ster, Marouane, Yassine

PDF

Open Access 2 Repos

TL;DR

This paper introduces FIJO, a new public dataset of insurance job offers with soft skill annotations, and evaluates transformer-based models for skill detection using this dataset.

Contribution

The paper presents FIJO, a novel annotated dataset for soft skill detection in insurance job ads, and assesses transformer models' effectiveness on this domain.

Findings

01

Transformers achieve good token-wise performance on FIJO.

02

FIJO reveals specific challenges in soft skill detection.

03

Analysis of errors highlights future research directions.

Abstract

Understanding the evolution of job requirements is becoming more important for workers, companies and public organizations to follow the fast transformation of the employment market. Fortunately, recent natural language processing (NLP) approaches allow for the development of methods to automatically extract information from job ads and recognize skills more precisely. However, these efficient approaches need a large amount of annotated data from the studied domain which is difficult to access, mainly due to intellectual property. This article proposes a new public dataset, FIJO, containing insurance job offers, including many soft skill annotations. To understand the potential of this dataset, we detail some characteristics and some limitations. Then, we present the results of skill detection algorithms using a named entity recognition approach and show that transformers-based models…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · AI and HR Technologies · Scheduling and Timetabling Solutions