PhDGPT: Introducing a psychometric and linguistic dataset about how large language models perceive graduate students and professors in psychology
Edoardo Sebastiano De Duro, Enrique Taietta, Riccardo Improta, Massimo, Stella

TL;DR
This paper introduces PhDGPT, a large dataset and framework capturing how GPT-3.5 perceives and simulates the psychometric and linguistic aspects of graduate students and professors in psychology, revealing both capabilities and limitations.
Contribution
It presents a novel dataset and prompting framework that model the machine psychology of academic figures, integrating psychometric scores with linguistic explanations for the first time.
Findings
LLMs can reconstruct human DASS factors with up to 80% accuracy.
Simulated academics show similar psychometric network structures to humans.
LLMs' explanations are less concrete and imageable for anxiety items.
Abstract
Machine psychology aims to reconstruct the mindset of Large Language Models (LLMs), i.e. how these artificial intelligences perceive and associate ideas. This work introduces PhDGPT, a prompting framework and synthetic dataset that encapsulates the machine psychology of PhD researchers and professors as perceived by OpenAI's GPT-3.5. The dataset consists of 756,000 datapoints, counting 300 iterations repeated across 15 academic events, 2 biological genders, 2 career levels and 42 unique item responses of the Depression, Anxiety, and Stress Scale (DASS-42). PhDGPT integrates these psychometric scores with their explanations in plain language. This synergy of scores and texts offers a dual, comprehensive perspective on the emotional well-being of simulated academics, e.g. male/female PhD students or professors. By combining network psychometrics and psycholinguistic dimensions, this study…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Educational Strategies and Epistemologies · Text Readability and Simplification
Methods15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Cosine Annealing · Linear Layer · Softmax · Refunds@Expedia|||How do I get a full refund from Expedia? · Linear Warmup With Cosine Annealing · Layer Normalization · Dropout · Multi-Head Attention
