Multilevel Sentence Embeddings for Personality Prediction
Paolo Tirotta, Akira Yuasa, Masashi Morita

TL;DR
This paper introduces a hierarchical sentence embedding method that efficiently captures multilevel structures and polarity, improving personality prediction across multiple datasets with a single model.
Contribution
A novel two-step hierarchical training approach for sentence embeddings that reduces computational costs and enhances performance in personality prediction tasks.
Findings
Single model outperforms class-specific models
Effective across multilingual Twitter datasets
Improves hierarchical sentence representation
Abstract
Representing text into a multidimensional space can be done with sentence embedding models such as Sentence-BERT (SBERT). However, training these models when the data has a complex multilevel structure requires individually trained class-specific models, which increases time and computing costs. We propose a two step approach which enables us to map sentences according to their hierarchical memberships and polarity. At first we teach the upper level sentence space through an AdaCos loss function and then finetune with a novel loss function mainly based on the cosine similarity of intra-level pairs. We apply this method to three different datasets: two weakly supervised Big Five personality dataset obtained from English and Japanese Twitter data and the benchmark MNLI dataset. We show that our single model approach performs better than multiple class-specific classification models.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Topic Modeling
