Assessment and manipulation of latent constructs in pre-trained language   models using psychometric scales

Maor Reuben; Ortal Slobodin; Aviad Elyshar; Idan-Chaim Cohen; Orna; Braun-Lewensohn; Odeya Cohen; Rami Puzis

arXiv:2409.19655·cs.CL·January 14, 2025

Assessment and manipulation of latent constructs in pre-trained language models using psychometric scales

Maor Reuben, Ortal Slobodin, Aviad Elyshar, Idan-Chaim Cohen, Orna, Braun-Lewensohn, Odeya Cohen, Rami Puzis

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a method to assess and manipulate latent psychological constructs in language models using reformulated psychometric questionnaires, enabling better interpretability and control of AI behavior.

Contribution

It presents a novel approach to evaluate and modify psychological traits in language models through psychometric scales reformulated as natural language inference prompts.

Findings

01

Human-like mental health constructs found in models

02

Psychometric assessment correlates with human psychology theories

03

Strategies to mitigate undesirable biases in models

Abstract

Human-like personality traits have recently been discovered in large language models, raising the hypothesis that their (known and as yet undiscovered) biases conform with human latent psychological constructs. While large conversational models may be tricked into answering psychometric questionnaires, the latent psychological constructs of thousands of simpler transformers, trained for other tasks, cannot be assessed because appropriate psychometric methods are currently lacking. Here, we show how standard psychological questionnaires can be reformulated into natural language inference prompts, and we provide a code library to support the psychometric assessment of arbitrary models. We demonstrate, using a sample of 88 publicly available models, the existence of human-like mental health-related constructs (including anxiety, depression, and Sense of Coherence) which conform with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cnai-lab/qlatent
pytorchOfficial

Videos

Assessment and manipulation of latent constructs in pre-trained language models using psychometric scales· underline

Taxonomy

TopicsNatural Language Processing Techniques

MethodsLib