CultureBERT: Measuring Corporate Culture With Transformer-Based Language   Models

Sebastian Koch; Stefan Pasch

arXiv:2212.00509·cs.CL·January 26, 2024·1 cites

CultureBERT: Measuring Corporate Culture With Transformer-Based Language Models

Sebastian Koch, Stefan Pasch

PDF

Open Access 1 Repo

TL;DR

This paper develops transformer-based language models to measure corporate culture from employee reviews, outperforming traditional methods and providing a new tool for analyzing organizational values and environment.

Contribution

It introduces a novel dataset of employee reviews labeled for corporate culture and fine-tunes transformer models to improve classification accuracy over traditional approaches.

Findings

01

Transformer models classify reviews 17-30% more accurately.

02

Models outperform traditional text classification methods.

03

Publicly available models facilitate future research.

Abstract

This paper introduces transformer-based language models to the literature measuring corporate culture from text documents. We compile a unique data set of employee reviews that were labeled by human evaluators with respect to the information the reviews reveal about the firms' corporate culture. Using this data set, we fine-tune state-of-the-art transformer-based language models to perform the same classification task. In out-of-sample predictions, our language models classify 17 to 30 percentage points more of employee reviews in line with human evaluators than traditional approaches of text classification. We make our models publicly available.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Stefan-Pasch/CultureBERT
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Computational and Text Analysis Methods · Topic Modeling