Continuous Output Personality Detection Models via Mixed Strategy   Training

Rong Wang; Kun Sun

arXiv:2406.16223·cs.CL·June 25, 2024

Continuous Output Personality Detection Models via Mixed Strategy Training

Rong Wang, Kun Sun

PDF

Open Access 1 Models

TL;DR

This paper introduces a novel mixed strategy training method for personality detection models that produce continuous trait scores, leveraging a large Reddit dataset and fine-tuned RoBERTa models to outperform traditional binary classifiers.

Contribution

It presents a new approach for training personality detection models to output continuous scores, improving accuracy and applicability over traditional binary methods.

Findings

01

Models predict Big Five traits with high accuracy

02

Significant performance improvement over binary classification

03

Enhanced applications in multiple fields

Abstract

The traditional personality models only yield binary results. This paper presents a novel approach for training personality detection models that produce continuous output values, using mixed strategies. By leveraging the PANDORA dataset, which includes extensive personality labeling of Reddit comments, we developed models that predict the Big Five personality traits with high accuracy. Our approach involves fine-tuning a RoBERTa-base model with various strategies such as Multi-Layer Perceptron (MLP) integration, and hyperparameter tuning. The results demonstrate that our models significantly outperform traditional binary classification methods, offering precise continuous outputs for personality traits, thus enhancing applications in AI, psychology, human resources, marketing and health care fields.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
KevSun/Personality_LM
model· 525 dl· ♡ 27
525 dl♡ 27

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Software Engineering Research · Sports Analytics and Performance