Cross-Lingual Attention Distillation with Personality-Informed Generative Augmentation for Multilingual Personality Recognition

Jing Jie Tan; Ban-Hoe Kwan; Danny Wee-Kiat Ng; Yan-Chai Hum; Noriyuki Kawarazaki; Kosuke Takano

arXiv:2604.08851·cs.CL·April 13, 2026

Cross-Lingual Attention Distillation with Personality-Informed Generative Augmentation for Multilingual Personality Recognition

Jing Jie Tan, Ban-Hoe Kwan, Danny Wee-Kiat Ng, Yan-Chai Hum, Noriyuki Kawarazaki, Kosuke Takano

PDF

1 Repo

TL;DR

This paper introduces ADAM, a novel multilingual personality recognition approach that combines cross-lingual attention distillation with personality-informed generative augmentation, significantly improving performance across multiple languages.

Contribution

The paper presents a new method integrating personality-guided data augmentation and cross-lingual attention distillation for enhanced multilingual personality recognition.

Findings

01

CLAD outperforms standard BCE across all languages and traits.

02

Augmentation with PIGA improves recognition accuracy.

03

Model achieves benchmark performance comparable to leading encoder models.

Abstract

While significant work has been done on personality recognition, the lack of multilingual datasets remains an unresolved challenge. To address this, we propose ADAM (Cross-Lingual (A)ttention (D)istillation with Personality-Guided Generative (A)ugmentation for (M)ultilingual Personality Recognition), a state-of-the-art approach designed to advance multilingual personality recognition. Our approach leverages an existing English-language personality dataset as the primary source and employs a large language model (LLM) for translationbased augmentation, enhanced by Personality-Informed Generative Augmentation (PIGA), to generate high-quality training data in multiple languages, including Japanese, Chinese, Malay, and French. We provide a thorough analysis to justify the effectiveness of these augmentation techniques. Building on these advancements, ADAM integrates Cross-Lingual Attention…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://research.jingjietan.com/?q=ADAM
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.