Enhancing Apparent Personality Trait Analysis with Cross-Modal Embeddings
\'Ad\'am Fodor, Rachid R. Saboundji, Andr\'as L\H{o}rincz

TL;DR
This paper introduces a multimodal deep learning approach with cross-modal embeddings to improve automatic personality trait analysis from short videos, addressing data imbalance and achieving higher accuracy.
Contribution
It proposes a Siamese-extended neural network utilizing modality-invariant embeddings for better personality trait prediction from multimodal data.
Findings
Achieves 0.0033 MAE improvement over baseline
Effectively handles under-represented extreme trait values
Demonstrates superior performance with multimodal data
Abstract
Automatic personality trait assessment is essential for high-quality human-machine interactions. Systems capable of human behavior analysis could be used for self-driving cars, medical research, and surveillance, among many others. We present a multimodal deep neural network with a Siamese extension for apparent personality trait prediction trained on short video recordings and exploiting modality invariant embeddings. Acoustic, visual, and textual information are utilized to reach high-performance solutions in this task. Due to the highly centralized target distribution of the analyzed dataset, the changes in the third digit are relevant. Our proposed method addresses the challenge of under-represented extreme values, achieves 0.0033 MAE average improvement, and shows a clear advantage over the baseline multimodal DNN without the introduced module.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPersonality Traits and Psychology · Cognitive Abilities and Testing
MethodsMasked autoencoder
