Is Personality Prediction Possible Based on Reddit Comments?
Robert Deimann, Till Preidt, Shaptarshi Roy, Jan Stanicki

TL;DR
This study investigates the feasibility of predicting personality types from Reddit comments using BERT-based classifiers, highlighting potential despite dataset challenges.
Contribution
It introduces a method to classify MBTI personality types from Reddit comments using supervised learning with BERT, exploring the correlation between text and personality.
Findings
Potential for personality classification from Reddit comments
Challenges due to dataset quality
BERT classifiers show promising results
Abstract
In this assignment, we examine whether there is a correlation between the personality type of a person and the texts they wrote. In order to do this, we aggregated datasets of Reddit comments labeled with the Myers-Briggs Type Indicator (MBTI) of the author and built different supervised classifiers based on BERT to try to predict the personality of an author given a text. Despite experiencing issues with the unfiltered character of the dataset, we can observe potential in the classification.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Personality Traits and Psychology
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Softmax · Linear Layer · Dropout · Adam · Layer Normalization · Weight Decay · Dense Connections · WordPiece
