Computational Job Market Analysis with Natural Language Processing
Mike Zhang

TL;DR
This paper explores NLP techniques to analyze job market data, introducing new datasets, methods for skill extraction, and domain-specific language model adaptations to improve job description understanding.
Contribution
It presents novel datasets, an active learning algorithm, and domain-specific NLP models tailored for extracting insights from job descriptions.
Findings
Effective skill extraction using weak supervision
Enhanced model performance with taxonomy-aware pre-training
Improved information retrieval from job ads
Abstract
[Abridged Abstract] Recent technological advances underscore labor market dynamics, yielding significant consequences for employment prospects and increasing job vacancy data across platforms and languages. Aggregating such data holds potential for valuable insights into labor market demands, new skills emergence, and facilitating job matching for various stakeholders. However, despite prevalent insights in the private sector, transparent language technology systems and data for this domain are lacking. This thesis investigates Natural Language Processing (NLP) technology for extracting relevant information from job descriptions, identifying challenges including scarcity of training data, lack of standardized annotation guidelines, and shortage of effective extraction methods from job ads. We frame the problem, obtaining annotated data, and introducing extraction methodologies. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods
